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Nature: the truth 


Myths always circulate about Nature’s editorial processes and policies. Here is an attempt 


to dispel them. 


yth 1: Perhaps the longest-lived myth is that publishing 
M:: preprint of a paper submitted to this journal will 

pre-empt its consideration. Not true, as has been said in 
these columns before. For more than 20 years, we have had a policy of 
treating preprints as equivalent to conference talks: intra-researcher 
communication that encourages informal feedback and leads to 
better papers. 

Myth 2: Nature journals do not want senior researchers to involve 
junior colleagues in the confidential process of peer review. Untrue. 
We positively encourage such involvement, to help graduate students 
and postdocs gain experience with due oversight. We ask that they be 
identified, give them credit and may well go to them directly for advice 
on subsequent papers. 

Myth 3: Referees can veto papers. Only on technical grounds. It has 
always been the editors who select which papers Nature publishes, 
even though referees’ assessments of significance are influential. We 
always heed technical comments, but reserve the right to disagree with 
a referee's recommendation as to whether publication is warranted. 

Myth 4: The authorship of a paper — including country and 
institution — influences Nature’s decision on whether to referee or 
publish it. Untrue. We frequently publish papers from first-time 
authors, and frequently reject papers by highly reputable research- 
ers on purely editorial grounds of the paper's significance. We offer 
the option of double-blind peer review for those authors who want it. 
We recognize the possibility of unconscious bias. 

Myth 5: Nature editors choose papers for anticipated media coverage 
or citations. Not true. Assessment of significance is what counts. In 
many areas of research, citations do indeed reflect significance, and 


we value such achievements. But many papers that we publish neither 
achieve nor are expected to achieve high numbers of citations. We value 
them, nevertheless, because we judge them to have intrinsic interest, or 
because of their potentially substantive impact on society. 

Myth 6: Nature editors sometimes reject papers without reading 
them fully. Untrue. 

Myth 7: Authors must prepare submitted manuscripts in the form 
consistent with our highly Nature-specific format guidelines. Not true. 
For submission purposes, we care only that the paper conforms roughly 
to our length stipulations, and that editors and referees can understand 
the claims and their bases. Figures and their legends do not have to be 
placed at the end of the text at submission stage. Only moving towards 
publication does the formatting matter. 

Myth 8: Within the Nature journals system, in which authors may be 
offered a transfer ofa rejected paper to another journal, the transferred 
paper may be underestimated by the receiving editors. Untrue. Editors 
assess papers on their own terms. Because such transfers are informed by 
our knowledge of our journals criteria, one would expecta substantially 
lower rate of prompt editorial rejections and a higher rate of refereeing 
for such transfers than for direct submissions — which is indeed borne 
out by our statistics. For example, manuscripts transferred within the 
Nature family were sent to external reviewers in February 2018 twice as 
frequently as those submitted directly. 

Myth 9: Nature editors never consider appeals. Not true. 

Our Guide to Authors may not be sufficiently clear on some of these 
policies, and we are working to improve it. Above all, we hope that 
this Editorial will help researchers, and correct sometimes widespread 
misconceptions about Nature’s processes and policies. = 


Cries for help 


An outpouring on Twitter highlights the acute 
pressures on young scientists. 


oor mental health is an issue for many of our readers, as 

underscored by the response to a tweet sent by @NatureNews last 

week, which highlighted rates of depression and anxiety reported 
by postgraduate students (see go.nature.com/2gtjxq). The reaction blew 
us away: more than 1,900 retweets and around 230 replies. 

“This is not one dimensional problem. Financial burden, hostile 
academia, red tape, tough job market, no proper career guidance. 
Take your pick,’ read one. “I'd love to see some of the comments 
under this thread published,’ wrote one responder. “There needs 
to be real conversation about this, not just observation.” We 


agree — which is why we are publishing some of the responses. 

There is a problem with the culture in science, and it is one that loads 
an increasing burden on the shoulders of younger generations. The 
evidence suggests that they are feeling the effects. (Among the tweets, 
one proposed solution to improving the PhD: “treat it like professional 
training instead of indentured servitude with no hope ofa career at the 
end?”) It will take a while to change that culture — and, unfortunately, 
it will probably take almost as long for some in the community to real- 
ize the need for it to change. But change it must. 

We intend to revisit this topic, starting in May, with a Careers 
Feature on depression. We want to hear more from readers on mental- 
health issues and the stresses that contribute. You can share your 
stories in confidence here: go.nature.com/stress-stories. 

We thank those who have already told theirs. “I hold down three 
jobs to fund my PhD, living in hopes of funding, it’s a constant strain,” 
wrote one. “So many others out there like me, and sometimes I wonder 
if it's even worth it. The research community will lose so many great 
minds to issues like this. It needs to be changed?” m 
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to improve the delivery of public services, and even to track 

and fight modern slavery. No wonder researchers around 
the world — including members of my own organization, the Royal 
Statistical Society in London — have had their heads in their hands 
over headlines about how Facebook and the data-analytics company 
Cambridge Analytica might have handled personal data. We know 
that trustworthiness underpins public support for data innovation, 
and we have just seen what happens when that trust is lost. 

Allegations that Cambridge Analytica obtained data on tens of 
millions of people from Facebook, in circumstances still being investi- 
gated by multiple regulators, and used them to target political advertis- 
ing in the 2016 US presidential election has led to a ‘techlash. There's 
been a US$60-billion fall in the value of the social- 
media giant, and a surge in people searching for 
how to delete their accounts. 

Much behind the outcry has been hiding in 
plain sight. Too many data companies’ business 
models are based on hoovering up our personal 
data and selling them. 

What can be done to restore trustworthiness? 
Social-media companies must do more than say 
sorry and vow to improve protections. They must 
adapt to ensure that data collected are used for the 
common good. 

The techlash snarls together several concerns. 
One is the protection of privacy. Some have argued 
that this requires strengthening the ownership we 
have over our own data, allowing people to select 
or sell levels of data use. This is problematic: it 
assumes ‘data about me’ are data I own. But many personal data are 
created through interactions with other people or services — if I have 
a relationship with somebody, who owns those data? A better question 
is what right to privacy does each of us have? I also doubt that offering 
more options for ownership would bring much change. Relatively few 
people shop around for the cable-television company or energy provider 
with the best rates or services. Why might they be more active with 
data? Finally, such a data-ownership model would increase inequal- 
ity. The well-offand the well-informed would be protected, leaving the 
vulnerable to trade their data away. 

Smart privacy regulation is a better approach to curbing inappropriate 
use of personal data. The European Union is making strides with its new 
General Data Protection Regulation policy, which will come into force in 
May and give EU nations stronger powers to deal with data breaches. In 
this area, the United States could learn from Europe. More widely, infor- 
mation regulation and regulators around the world need strengthening. 
Many policies were set up when the collection and use of personal data 
were backwater issues. In particular, to create and implement the best 
policies, regulators must be able to pay competitive salaries to recruit 


D ata science brings enormous potential for good — for example, 


DATA COMPANIES 
SHOULD WORK TO 


BOOST 


POTENTIAL 


USE OF DATA 


FOR THE 


PUBLIC GOOD. 


Use our personal data for 
the common good 


Technology giants should take lessons from the Human Genome Project and be 
data stewards, not data owners, says Hetan Shah. 


technical talent, or risk losing it to the very giants that need regulating. 

There is more than privacy at stake. Facebook and other social-media 
companies are now information (and misinformation) providers that 
affect our democracies. They need to ensure that their algorithms do not 
promote misinformation as clickbait. It would be in Facebook's interest 
to nourish a system that creates reliable content to fuel its users’ inter- 
actions. Facebook could do much good if it put just 1% ofits profits into 
an independent trust to fund quality media, especially local media, and 
fact checkers. 

Another issue for democracy is microtargeted political advertising. 
Claims that Cambridge Analytica made about its ability to use this tactic 
to change people's minds on political issues were probably overblown. 
Microtargeting is not inherently unethical, but it must be made fully 
transparent. We cannot do democracy in the dark. 

There is also unease that technology compa- 
nies will grow into unchecked data monopolies. It 
would be hard to break the companies up, because 
we would then lose the networked benefits we get 
from them as consumers. But how else might we 
ensure the use of data for the public good rather 
than for purely private gain? 

Here are two proposals towards this goal. 

First, governments should pass legislation to 
allow national statistical offices to gain anonymized 
access to large private-sector data sets under openly 
specified conditions. This provision was part of the 
United Kingdom's Digital Economy Act last year 
and will improve the ability of the UK Office for 
National Statistics to assess the economy and society 
for the public interest. 

My second proposal is inspired by the legacy of John Sulston, who 
died last month. Sulston was known for his success in advocating for 
the Human Genome Project to be openly accessible to the science 
community, while a competitor sought to sequence the genome first 
and keep data proprietary. 

Like Sulston, we should look for ways of making data available for 
the common interest. Intellectual-property rights expire after a fixed 
time period: what if, similarly, technology companies were allowed to 
use the data that they gather only for a limited period, say, five years? 
The data could then revert to a national charitable corporation that 
could provide access to certified researchers, who would both be held 
to account and be subject to scrutiny that ensure the data are used for 
the common good. 

Technology companies would move from being data owners to 
becoming data stewards. m 


Hetan Shah is executive director of the Royal Statistical Society and 
visiting professor at the Policy Institute, King’s College London. 
Twitter: @HetanShah 
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Alinvestment 

France will invest €1.5 billion 
(US$1.8 billion) in artificial- 
intelligence (AI) research and 
innovation by 2022, as part of 
a national strategy unveiled by 
French President Emmanuel 
Macron at a conference in Paris 
on 29 March. INRIA, France’s 
national computer-science 
agency, will coordinate the plan 
with other French research 
agencies and universities. 

The strategy aims to create 
networks of AI research 
institutes in four or five places 
across the country. At the 
conference, Google's London- 
based AI firm DeepMind 

also announced that it will 
establish a centre in Paris, its 
first in continental Europe. 
South Korean electronics giant 
Samsung said that it would 
create a large Al centre in or 
near the capital and employ 
about 100 researchers. Japanese 
firm Fujitsu announced plans 
to make France its European 
centre of AI research. 


Fuel efficiency 

On 2 April, the US 
Environmental Protection 
Agency (EPA) announced 
plans to relax fuel-efficiency 
standards for vehicles 
manufactured between 

2022 and 2025. The current 
standard, which was 

finalized in January 2017 

after a review by former 
president Barack Obama's 
administration, would require 
the average fuel efficiency 

for new passenger cars and 
trucks to be 23.2 kilometres 
per litre by 2025 — 33% higher 
than the goal in 2010. EPA 
chief Scott Pruitt said that the 
standards had been set too 
high. He will also reconsider 
an EPA waiver that allows 
California to set fuel-efficiency 
standards independently of 


The news in brief 


Chinese space lab plummets to Earth 


The defunct Chinese space station Tiangong-1 
re-entered Earth’s atmosphere on 2 April, 
breaking apart over the southern Pacific 
Ocean at around 00:15 utc. The bus-sized 
spacecraft had been in orbit since 2011. Two 
groups of Chinese astronauts lived and worked 


the federal government, and 
which other states may choose 
to follow. The agency will work 
with the National Highway 
Traffic Safety Administration 
to establish a new standard. 


CRISPR crops 


The US Department of 
Agriculture (USDA) will not 
regulate plants produced using 
genome-editing techniques, 
including CRISPR-Cas9, 

the agency announced on 

28 March — as long as they 
could also have been created 
using conventional breeding 
techniques. The engineered 
products cannot be plant pests 
or have been developed using 
plant pests, and they can't 
contain genes from distant 
species. Those plants will 

still be regulated. The policy 
reverses rules proposed under 
former US president Barack 
Obama. It’s unclear whether 
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food produced from gene- 
edited plants will need to be 
labelled as such. 


UK funding body 


Britain's powerful new 
research-funding body 
officially came into existence 
on 1 April. UK Research and 
Innovation (UKRI), born out 
of sweeping higher-education 
and research reforms passed 
last year, brings together the 
country’s seven research- 
funding councils and the 


business-focused Innovate UK. 


Research England, a new body 
that will oversee the research 
and knowledge-exchange 
activities of the now-defunct 
Higher Education Funding 
Council for England, will fall 
under the same umbrella. 
UKRyI has an annual budget of 
£6 billion (US$8.4 billion) and 


on it in 2012 and 2013, using it as a testbed 

for a future, larger space station. But in 2016, 
mission managers lost control of the spacecraft. 
Asa result, it began an uncontrolled descent 

to Earth, although it was closely watched by 
organizations that track space debris. 


is headed by Mark Walport, 
the government's former chief 
scientific adviser. 


Singapore funding 
Singapore's largest national 
research agency announced 
that groups doing basic 
scientific research will need 
to start competing for all 
their funding, as part ofa 
reorganization unveiled on 
27 March. The Agency for 
Science, Technology and 
Research (A*STAR) runs 

18 research institutes; it will 
continue to guarantee core 
funding to those that partner 
with industry or focus on 
technology development. 
Basic scientists at those 
centres will be funded entirely 
through merit-based awards. 
The changes will take effect 
from 1 April, but will not 
affect projects that already 
have funding. 
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ASTRONOMY 


The mirrors of the James Webb Space Telescope are designed to peer at the Universe’s first stars. 


Se ne: © 


NASA reveals major delay 
for Hubble successor 


James Webb Space Telescope woes could have broader effects on astrophysics programme. 


BY ALEXANDRA WITZE 


ASA will delay the launch of its 
Nexto James Webb Space Tele- 

scope (JWST) by nearly a year, until 
approximately May 2020. That is likely to push 
the cost of the mission — the most complex 
space-science telescope ever built — over the 
US$8-billion limit set by the US Congress. It 
is the first major setback since NASA revised 
its plans for the project in 2011, after years of 


slipping schedules and rising costs. 

NASA announced the delay on 27 March, 
saying that engineers needed more time to 
assemble and test the components of the 
spacecraft at its main contractor, Northrop 
Grumman in Redondo Beach, California. 
Among other problems, the collapsible, 
tennis-court-sized sunshield that will protect 
the observatory’s 6.5-metre mirror took weeks 
longer than expected to fold and refold during 
testing. “Frankly, the tests are taking longer to 


complete than expected,” Robert Lightfoot, 
NASAs acting administrator, told reporters. 

The agency did not say how much the delay 
would cost, but some estimates suggest that it 
could add a few hundred million dollars to the 
project. In recent years, Congress has pushed 
NASA to hold down the cost of the telescope 
and other future missions. 

The delay will affect NASA's astrophysics 
budget more broadly, including its next big 
planned space observatory, the Wide-Field > 
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> Infrared Survey Telescope (WFIRST). 
The agency had aimed to spend more on 
WFIRST in the coming years as its contribu- 
tions to the JWST shrank, trading off the end 
of one mission's development for the begin- 
ning of another. Now, the JWST delay risks 
compounding problems for WFIRST. Last 
month, US President Donald Trump proposed 
cancelling WFIRST; astronomers protested, 
and Congress gave the project $150 million. 

US astronomers ranked the JWST and 
WFIRST as the most important large space mis- 
sions, respectively, in decadal surveys of their 
scientific priorities released in 2000 and 2010. 

NASA delayed the JWST launch after an 
independent board of experts concluded this 
month that the project could not meet its June 
2019 launch goal. Last September, the agency 
shifted to that target, abandoning the October 
2018 launch date it had aimed for since reboot- 
ing the project in 2011. 

Now, yet another independent panel — led 
by former aerospace executive Tom Young — 
will review the project’s schedule. NASA will 
use these analyses to pick a more specific 
launch target in the coming months. “We 
have one shot to get this right before going 
into space,” says Thomas Zurbuchen, NASA‘s 
associate administrator for science. 

Until recently, project managers were able to 
deal with schedule problems by moving work 


tasks around locations. For instance, engineers 
decided to clean the telescope’s mirror at 
NASA’ss Johnson Space Center in Houston 
before shipping it to the crowded spacecraft- 
assembly facility at Northrop. 
Northrop staff are now working 24 hours 
a day, but cannot handle enough parts simul- 
taneously to stay on schedule. The increased 
number of workers is adding to the project's 
overall cost, the 


“We have one US Government 
shot to get this Accountability Office 
right before reported last month. 
going into There are other 
space. 4 problems, too. In 


April 2017, a techni- 
cian applied too high a voltage during a test, 
damaging components of the propulsion sys- 
tem that took more than a month to replace. In 
October, engineers discovered several tears in 
the sunshield caused by “workmanship error”. 
And part of the five-layer sunshield snagged 
during a deployment test. 

In a statement, Northrop said that it 
“remains steadfast in its commitment to NASA 
and ensuring successful integration, launch 
and deployment” of the telescope. 

All the testing is crucial because the JWST 
will operate from an orbit about 1.5 mil- 
lion kilometres from Earth, where it cannot 
be serviced by astronauts as the Hubble Space 


Telescope was. The JWST will be 100 times 
more powerful than Hubble, and will survey 
the Universe mainly in infrared wavelengths. 
Among the many celestial phenomena that it 
aims to explore are the first stars and galaxies 
to form in the Universe, as well as planets in 
and beyond the Solar System. 

NASA had asked scientists to submit 
proposals for the JWST’s first set of observa- 
tions by next week, but the agency has can- 
celled that deadline. “Wed rather have it launch 
later and work perfectly than rush and have 
problems,” says Emily Levesque, an astrono- 
mer at the University of Washington in Seattle. 
“But there are going to be a lot of people 
considering what this means for astronomy.’ 

Garth Illingworth, an astronomer at the 
University of California, Santa Cruz, says that 
the next decadal survey, scheduled for 2020, 
should be postponed. “It’s hard to imagine a 
group of people thinking clearly about what to 
do in the future with uncertainty about JWST’s 
performance hanging around, he says. 

NASA will also have to figure out how to 
accommodate the extra costs — perhaps by 
taking them out of the operations budget for 
the JWST, penalizing Northrop or delaying 
WFIRST. The agency has spent $7.3 billion 
on the JWST so far, Lightfoot said, and cannot 
exceed $8 billion without permission from 
Congress. m 


BIOMEDICAL RESEARCH 


Cancer researchers push to 
relax rules for clinical trials 


US government examines whether study criteria unnecessarily exclude some people. 


BY HEIDI LEDFORD 


early 20% of publicly funded cancer 
Nev trials in the United States 
fail because investigators are unable 
to enrol enough participants. Yet patients 
and their physicians often grow frustrated 
when they encounter the sometimes- 
insurmountable requirements to join a study. 
Now, researchers are pruning the lengthy 
lists of eligibility criteria for trials, in the hope 
of nixing unnecessary rules that might be hin- 
dering research. On 16 April, representatives of 
the US Food and Drug Administration (FDA) 
will meet stakeholders in Washington DC to 
discuss how restrictive eligibility criteria for 
clinical trials could be limiting people’s oppor- 
tunities to access experimental treatments — 
and the quality of the data generated by the 
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studies. The agency plans to use the informa- 
tion it gathers to develop guidelines for drug 
makers. 

“You can have the greatest ideas and the 
greatest science,” says Stuart Lichtman, an 
oncologist at Memorial Sloan Kettering Cancer 
Center in New York City. “But if no one goes on 
the study, what good is it?” 

Eligibility requirements are typically 
intended to protect either the participant or the 
study. Participants with some degree of liver 
failure, for example, might not be allowed to 
take part in a trial ofa drug thought to pose a 
risk to that organ. Criteria might also exclude 
people with conditions that could confound 
the results of a study. 

But some researchers say that a ‘cut-and- 
paste’ mentality has increased clinical-trial 
requirements over time, as scientists have 


used previous trial protocols as templates for 
their next studies. That might be needlessly 
restricting participation in trials. 

David Gerber, a lung-cancer specialist at 
the University of Texas Southwestern Medi- 
cal Center in Dallas, and his collaborators 
have found that 80% of clinical trials spon- 
sored by the US National Cancer Institute 
excluded people with previous cancer diagno- 
ses (D. E. Gerber et al. J. Natl Cancer Inst. 106, 
dju302; 2014). Yet in many cases, he says, the 
previous cancer might have been caught early 
and removed successfully before the person 
developed lung cancer. 

“What really frustrates me are instances 
when, in my mind and in my heart, it really 
seemed that the patients should be eligible,” 
says Gerber. “If I had the exact same treat- 
ment outside of a clinical trial, I would give 
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Participants in clinical trials of cancer drugs must often meet a lengthy list of eligibility criteria. 


it to them without a concern” 

A joint project by the FDA, the American 
Society of Clinical Oncology (ASCO) in 
Alexandria, Virginia, and the advocacy group 
Friends of Cancer Research in Washington DC 
has found that five common criteria for 
cancer-trial eligibility could often be amended 
without harming participants or the integrity 
of the trial. The team published its results last 
October (E. S. Kim et al. J. Clin. Oncol. 35, 
3737-3744; 2017). 

People with HIV, for example, were once 
excluded from trials because of their poor 
prognosis. Now, with treatment, they often live 


as long as people without the virus and should 
be included in many cancer trials, the group 
concluded. 

The team also recommended that in some 
cases, researchers should ease restrictions on 
people with organ dysfunction. That could be 
particularly important in light of the ageing 
populations in some countries, including the 
United States, says Lichtman. The restrictions 
were put in place when cancer treatments were 
more broadly toxic, he notes, and might not be 
necessary for the more targeted drugs available 
today. 

One recommendation that could generate 
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some controversy, he says, is a push to lower 
the age of eligibility for many adult cancer 
trials from 18 to 12. This reflects an under- 
standing of basic drug metabolism, says 
Edward Kim, an oncologist at Atrium Health 
in Charlotte, North Carolina, who chaired the 
ASCO effort. “There is nothing magical about 
18, he says. “Your body pharmacologically 
metabolizes drugs the same way at age 12 as 
it does at age 18” 

But some adult-cancer physicians might feel 
uncomfortable treating younger people, and 
often treatment of these individuals takes place 
in specialized children’s hospitals, unlike adult 
clinical trials. Furthermore, most adolescent 
cancers are rare, and they can differ from adult 
cancers — even when they start in the same 
organ. This means the change might have lit- 
tle impact on research overall, says paediatric 
oncologist Peter Adamson of the Children’s 
Hospital of Philadelphia in Pennsylvania. But 
it could still help individual adolescents who 
might otherwise have been excluded from tri- 
als, he adds: “It’s the right thing to do” 

Kim and others are now working to see their 
changes implemented, and have submitted 
their suggestions to an influential programme 
that coordinates clinical development of new 
therapies at the US National Cancer Institute. 
Kim says he has been contacted by research- 
ers at large pharmaceutical companies who are 
eager to make the changes in their upcoming 
trials. 

The result, he says, could be data that are 
more relevant to the people whom he and his 
colleagues treat every day. “These patients have 
these characteristics and they’re going to be 
treated eventually by their doctors,” says Kim. 
“This is the real world.” = 


ASTROPHYSICS 


Dark-matter detector 
in Italy strikes again 


Upgraded experiment sees a beguiling data fluctuation. 


BY DAVIDE CASTELVECCHI 


group of physicists says that it is still 
Aeive signs of dark matter — the 

mystery substance thought to make up 
85% of matter in the Universe — 20 years after 
it saw the first hints of such a signal. 

DAMA, a collaboration of Italian and 
Chinese researchers, has announced long- 
awaited results from six years of data-taking, 
which followed an upgrade to the experi- 
ment in 2010. The findings are a boost for 


the multiple groups attempting to reproduce 
DAMAs results, which have been controver- 
sial and contradict those of other experiments. 
But DAMAs improved sensitivity also makes 
its results harder to explain, physicists say. 
Observations of galaxies and of the Uni- 
verse’s primordial radiation imply that the 
vast majority of matter is of a type that is invis- 
ible and interacts almost exclusively through 
gravity. Many theories exist for explaining 
the nature of this dark matter, and lots of 
experiments have been attempting to detect it 


through its subtle interactions with ordinary 
matter. 

Rita Bernabei, a physicist at the University 
of Rome Tor Vergata who has led DAMA since 
its early days, presented the latest results on 
26 March at a meeting at central Italy’s Gran 
Sasso National Laboratory, where the experi- 
ment sits in a cavern under a mountain. Like 
many detectors, DAMA aims to measure the 
tiny amount of energy given off when atoms of 
ordinary matter on Earth interact with unseen 
particles in a ‘halo of dark matter thought to 
envelop the Milky Way. 

DAMA works by recording flashes of light 
that occur inside crystals of sodium iodide when 
subatomic particles hit the nucleus of a sodium 
or iodine ion. Interactions with dark-matter 
particles should make that signal vary through- 
out the year. That’s because, as the Sun moves 
around the Galaxy, Earth ploughs through the 
dark-matter halo more quickly in some parts 
of its orbit around the Sun than in others. The 
signals should peak in early June and be at their 
lowest in early December, says Katherine > 
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> Freese, a theoretical astroparticle physicist at 
the University of Michigan in Ann Arbor, who 
was part of the team that first proposed looking 
for such a signal, in 1986 (A. K. Drukier et al. 
Phys. Rev. D33, 3495-3508; 1986). 

When DAMA first announced that it had 
seen such a fluctuation in 1997, soon after an 
early version of the experiment was turned on, 
the physics community was sceptical. Critics 
doubted that this effect was a genuine sign 
of dark matter. Instead, they said, terrestrial 
sources or quirks in the apparatus might be 
mimicking a real signal. There was also a pos- 
sibility that the blip would vanish after parts of 
the detector were replaced with newer technol- 
ogy. But that didn’t happen. “The modulation 
is still there, loud and clear,’ says Freese. 

A number of increasingly sophisticated 
experiments that should also see dark mat- 
ter — although using different techniques — 
have so far found none. But the DAMA team 
has continued to see a fluctuation. The group 
confirmed that it had seen the signal in 2013 
(R. Bernabeiet al. Eur. Phys. J. C 73, 2648; 2013), 
with a previous incarnation of the experiment. 
The latest findings from DAMA come as other 
experiments attempt for the first time to cor- 
roborate or disprove the claim using the same 
type of sodium iodide crystal as in DAMA. 

Leading that pack is COSINE-100, a US 
and South Korean experiment at the Yang- 
yang underground laboratory in South 
Korea. Hyunsu Lee, a physicist at the Institute 
for Basic Science in Daejeon, says that had 
DAMaAs signal disappeared in the new data, it 
would have dampened motivation for carrying 
out further sodium iodide experiments. 

“For us, these results are very encourag- 
ing,’ says Susana Cebrian, a physicist at the 
University of Zaragoza in Spain who works on 


\) 


The DAMA experiment in Italy is hunting for signs of dark matter. 


another replication attempt, called ANAIS, 
in the Canfranc Underground Laboratory in 
the Pyrenees. 


UNEXPECTED DEVIATION 

But DAMA’ latest results have a twist. The 
upgrade has made the detector sensitive to 
lower-energy collisions — signals from slower- 
moving particles. For typical dark-matter 
models, the timing of the fluctuations, as seen 
from Earth, should reverse below certain ener- 
gies: “It should peak in December and be at 
a minimum in June,” says Freese. The latest 
results don't show that. 

The deviation “is refreshing, and food for 
thought’, says Juan Collar, an experimental 
physicist at the University of Chicago in Illinois 
who works on dark-matter detection. 

But many physicists still express scepticism. 


Dan Hooper, a physicist at the Fermi National 
Accelerator Laboratory in Batavia, Illinois, 
tweeted on 26 March: “I cannot come up with 
a viable model that can produce this signal.” 

Freese, who isn't part of the DAMA collabo- 
ration, is more sanguine. She says that the data 
at low energies are still tentative, and could yet 
be compatible with a flip. 

“It is more urgent than ever that an 
independent experiment based on the same 
technique, like ANAIS, could reproduce the 
effect,” Cebrian says. Other experiments are 
planned in Australia and Japan. 

Although DAMAs latest upgrades removed 
some potential concerns that the effect might 
have been generated inside the detector, Collar 
says: “The mystery, however, remains of why 
their result is incompatible with just about 
every other finding in this field” = 


S. SCHIAVON/LNGS-INFN 


Copyright reforms draw 


fire from scientists 


Planned changes to EU regulations prompt concerns that they willimpede open science. 


BY QUIRIN SCHIERMEIER 


n influential committee of the Euro- 
A= Parliament is due to vote this 
month on changes to copyright regula- 
tions, but the latest drafts of the rules have trig- 
gered a wave of criticism from open-science 
advocates. They say that the proposals will 
stifle research and scholarly communication. 
Intellectual-property experts agree that 
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existing EU copyright rules need an overhaul for 
the digital age, and a proposal first circulated by 
the European Commission in 2016 had this goal 
in mind. But critics worry that some provisions 
in more-recent proposals for the law — known 
as the directive on copyright in the digital single 
market — conflict with Europe’ principles of 
open science and freedom of expression. 
“Copyright law must not hamper open 
science; says Vanessa Proudman, European 


director of the Scholarly Publishing and Aca- 
demic Resources Coalition (SPARC), a science- 
advocacy group in Apeldoorn, the Netherlands. 
“The EU has made significant headway towards 
open access of research funded by European 
citizens. The proposed new rules would clearly 
impede further progress, threatening the 
visibility of Europe's research,” she says. 
Concerns focus on a provision that would let 
publishers claim royalties for the use of snippets 
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of information, such as tables or headlines. 
This was included with the aim of enabling 
news publishers to secure revenue from social- 
media platforms such as Facebook and Google. 
But a proposal added by a European-parliament 
committee would mean that the provision also 
applies to academic publications. 

Many scholarly publishers, including the 
International Association for Scientific, Tech- 
nical and Medical Publishers (STM), based 
in Oxford, UK, support this amendment. But 
open-research advocates say that facts and 
information in a scientific article must remain 
free from copyright. “We really don’t want fur- 
ther paywalls on top of any research materi- 
als libraries have paid for already,’ says Maria 
Rehbinder, a copyright specialist in Aaalto, 
Finland, with the Association of European 
Research Libraries. 


FEE CONTROVERSY 
Some researchers express concern that the 
proposed rule might even force scientists 
to pay fees to publishers for references 
they include in their own publications. But 
STM “cannot envisage any situation where 
students and researchers would need to 
pay fees” for citations, says Matt McKay, a 
spokesperson for STM. 

The EU copyright law, as written, would also 
compel research repositories to prevent uploads 


of copyrighted papers and other content. 
Currently, the onus is on academic publishers 
to issue take-down notices for papers illegally 
posted to repositories. 

The scholarly social network ResearchGate, 
for example, has in recent months disabled 
public access to more than 1.7 million papers 
on its site, in compliance with take-down mes- 
sages by publishers. This process of removing 

articles upon request, 


“We really don’t says Proudman, 
want further works well and effec- 
paywalls ontop __ tively for institutional 
of any research repositories. Forcing 
materials all existing non- 
libraries profit educational 
have paid for and research-data 
already.” services, including 


more than 1,000 uni- 
versity repositories, to seek copyright licences 
and install upload filters would overburden 
most institutions, she says. “The proposed level 
of surveillance would put science repositories 
in the same boat as Facebook or YouTube,” she 
says, by requiring them to scan submissions for 
possible copyright violations. 

The proposed rules arent all bad news for 
science, says Marie Timmermann, who is 
in charge of EU legislation and regulatory 
affairs at Science Europe, an association of 
national research-funding agencies in Brussels. 
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Text-mining — in which researchers use com- 
puter programs to extract data automatically 
from large numbers of texts — is exempted from 
the copyright law, when carried out in the public 
interest. Scientists at public research organiza- 
tions would be allowed to harvest facts and data 
from all sources they have legal access to read. 

However, this exemption does not extend to 
companies — a possible problem for EU-funded 
research projects, which increasingly include 
commercial partners, Timmermann notes. 

The European Parliament legal committee's 
vote on the law, scheduled for 23-24 April, will 
bea crucial test of whether lawmakers are listen- 
ing to scientists’ concerns. The precise version 
the committee will consider has not yet been 
finalized and circulated, and the final law will 
also need to be approved by the entire parlia- 
ment and by EU member states before it can 
come into effect, due for next year. “For the sake 
of European research, we hope the worst flaws 
will yet be deleted,” Timmerman says. m 


CORRECTION 

The Editorial ‘Al diagnostics need attention’ 

(Nature 555, 285; 2018) gave an inaccurate 
description of the methods in a 2017 study. 
The model detected breast cancer in whole 

slide images, not mammograms. 
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CLARIFICATION 

The News story ‘Copyright reforms draw 
fire from scientists’ (Nature 556, 14-15; 
2018) should have made it clear that when 
Vanessa Proudman talked of “that process” 
she was referring to how institutional 
repositories deal with copyright violations. 
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Send in the germs 


Lab mice are usually kept squeaky clean, but some immunologists 
think a dose of dirt could make them more useful for science. 


BY CASSANDRA nan unseasonably warm February morning, Mark Pierson 
WILLYARD takes a 20-minute drive to one of Minneapolis’s larger pet 
shops. Pierson, a researcher in an immunology laboratory at 
the University of Minnesota, often comes here to buy mice, 
so most of the staff know him. Today he asks for ten, and an employee 
fishes them out ofa glass box. Pierson requests the smaller mice because 
they're typically younger, but he isn’t too picky. They probably all have 
what he wants: germs. 

These mice are about to enter one of the most tightly controlled labs 
in the country, a facility normally reserved for studying dangerous 
pathogens such as tuberculosis and chikungunya virus. The rodents 
probably don’t carry serious human infections, but they do harbour 
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diseases that pose a grave threat to the hundreds of other research 
mice in the building. 

The pet-shop mice are about to get new room-mates. Each one will 
bunk with a group of shiny black lab mice, sharing food, water, bed- 
ding and, most importantly, pathogens. Until now, the lab mice have 
been kept in a squeaky clean environment, free from most diseases, 
so some will fall ill and die. The rest will develop more robust immune 
systems, more like those of wild mice — and, arguably, humans. 

What Pierson is doing breaks the rules. For more than 50 years, 
scientists have worked to make lab mice cleaner. In most labs today, 
the animals’ cages are sanitized, and their water bottles and food are 
sterilized. “We really go to great lengths to keep natural infectious 
experience out of the mouse house,” says David Masopust, an immu- 
nologist at the University of Minnesota who heads the lab where 
Pierson works. Those efforts have paid off: with the confounding 
effects of pathogens controlled, mouse experiments have become 
less variable. 

But a raft of studies now suggests that this cleanliness has come 
at a cost, leaving the rodents with stunted immune systems. In a 
quest for standardized and spotless mice, scientists have made the 
creatures a less-faithful model for human immune systems, which 
develop in a world teeming with microbes. And that could have 
serious implications for researchers working to usher treatments 
and vaccines out of the lab and into the clinic. Although it’s not 
yet possible to pin specific failures on the impeccable hygiene of 
standard mouse models, Masopust thinks the artificial environment 
must have some effect. It’s no secret that the success rate for moving 
therapies from animal to humans is abysmal — according to one 
estimate’, 90% of drugs that enter clinical trials fail. “You have to 
wonder if you might sometimes get misinformed simply because 
youre in a clean environment,’ says Masopust. 


“Ts this a mouse issue, 
really just a lab-mouse issue?” 


That's why he and other researchers are developing dirtier models 
that better replicate how the immune system develops in the natu- 
ral world. Some groups have given their mice infections”’, others a 
more natural microbiome*”. But housing the dirtier mice can be risky. 
Pet-shop mice carry so many infections, it’s as if they came from “a 
Dickensian orphanage’, says Aaron Ericsson, a microbiome researcher 
at the University of Missouri in Columbia. Lab-animal caretakers take 
biosecurity very seriously and mice are a precious resource. “The last 
thing you'd want to do is have some sort of an outbreak” 


DISH THE DIRT 

Masopust began thinking about the cleanliness problem more than 
a decade ago. He was struck by how much the immune make-up of 
lab mice differs from that of humans. At the time, many researchers 
blamed the differences on genetics, but Masopust suspected that lab 
mice are different in part because of where they live. “Is this a mouse 
issue,’ he wondered, “or is this really just a lab-mouse issue?” 

To answer that, Masopust started comparing the immune systems 
of lab mice to those of mice he had trapped in barns and bought from 
pet shops. Lab mice had many fewer cancer- and infection-fighting 
memory T cells — immune cells that have previously been exposed 
to pathogens — in their blood. They were also almost entirely lack- 
ing T cells in other tissues in the body. Humans, wild mice and pet- 
shop mice are swarming with these tissue-resident memory T cells. 
Overall, the lab mice’s immune systems looked less experienced, 
more like that of a human infant than that of an adult. 

Masopust suspected that past infections played an important 
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part. If so, he thought he might be able to induce changes in the lab 
mice’s immune systems by exposing them to infectious agents. If the 
lab-mouse problem was cleanliness, could he make them dirtier? 

He devised a seemingly simple experiment: he would drop a pet- 
shop mouse into a cage with several lab mice. The lab mice would pick 
up whatever the pet-shop mouse was carrying — everything from 
fur mites and pinworms to mouse hepatitis — and perhaps become 
immunologically more like the pet-shop mouse. This co-housing 
approach would let the researchers “take our cherished well-defined 
inbred strains and push them closer to the kind of normal immune 
experience that a human would have’, says Stephen Jameson, a Uni- 
versity of Minnesota immunologist who collaborates with Masopust. 

But there was one major hurdle: the researchers had nowhere to 
put the germ-ridden rodents. “The last thing I want to do is con- 
taminate my colleagues’ mouse colonies,” Masopust says. When 
he first discussed the experiment with the animal-resources staff, 
“jt definitely induced heart palpitations’, he says. But in a stroke of 
good luck, the university was about to construct a high-containment 
laboratory in Masopust’s own building. The facility was designed 
for biosafety-level-three (BSL-3) research, meaning that it would 
securely contain pathogens that can infect humans. But it would also 
prevent mouse pathogens from spreading to other mice. In 2013, 
Masopust and his colleagues managed to secure a room there. “I was 
lucky,’ he says. “It was under-utilized. They needed revenue. That 
helped them be open-minded. Today, that room houses 500 mice 
in plastic cages, each one containing a handful of sleek lab mice and 
one scrappy pet-shop mouse. 

After a month bunking with the pet-shop mice, the newly dirty 
lab mice had many of the same immunological features as wild and 
pet-shop mice’. They had more differentiated memory T cells than 
normal lab mice, and they developed tissue-resident memory T cells. 
The standard lab mice looked immunologi- 
cally similar to newborn babies in terms 
of which of their genes were more or less 
active, but the gene-activity profiles of 
pet-shop and co-housed mice were closer 
to those of adult humans. The dirty mice 
also mounted much greater resistance than 
clean mice when the researchers infected 
them with the bacterium Listeria monocytogenes: three days after 
infection, the number of bacteria they were carrying fell by more 
than four orders of magnitude, a response comparable to that of lab 
mice that have been vaccinated against the bacterium. 

Soon after Masopust began work in the BSL-3 lab, Herbert Virgin, 
an immunologist at Washington University in St. Louis, Missouri, 
and his colleagues independently embarked on a similar project to 
understand the immune systems of lab mice. But rather than using 
pet-shop mice to transmit infections, they decided to transmit the 
infections themselves, an approach that offered more control than 
co-housing. “As somebody who also has trained as a virologist, I 
like to know what the pathogen is, going in,” says Tiffany Reese, a 
member of Virgin’s lab at the time, and now a viral immunologist 
at the University of Texas Southwestern Medical Center in Dallas. 

They selected four pathogens: two types of herpesvirus, one influ- 
enza Virus and an intestinal worm called a helminth that chronically 
infects the small intestines of mice. The pathogens were all simi- 
lar to those that often infect children in developing countries. The 
researchers gave the mice the infections one at a time and allowed 
the animals time to recover before administering the next infection 
— in much the same way as humans get an infection, recover, then 
get another. Another group of mice received mock inoculations with 
saline. The final immune challenge was a vaccination against yellow 
fever, which uses a live but weakened form of the virus. 

Like Masopust’s group, the researchers noticed significant changes 
in the sequentially infected mice’. They differed in their gene-expres- 
sion profiles and in their response to the vaccination: at first, both 
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groups had the same antibody responses, but a month later, the co- 
infected mice had lower antibody levels. It’s not clear yet whether 
this difference affected how well the vaccine worked. “I think the 
jury is out about whether it has any specific utility,’ Virgin says. Still, 
he hopes that these dirtier models will lead to greater mechanistic 
understanding of the immune system. 


CALL OF THE WILD 

Other researchers have bypassed the pet shop in their quest for dirty 
mice. Immunologist Stephan Rosshart at the US National Institute of 
Diabetes and Digestive and Kidney Diseases (NIDDKD) in Bethesda, 
Maryland, has driven hundreds of kilometres, visiting horse barns 
throughout that state and the District of Columbia to collect wild mice. 

Rosshart had joined the lab of NIDDKD immunologist Barbara 
Rehermann in 2013, and the two began poring over the literature on 
the microbiome, the collection of microorganisms that live on and 
in a larger organism. The studies showed that the microbiome has a 
huge influence on the immune system, but most of the papers they 
found were based on a comparison of two types of lab mouse: some 
with a lab-derived microbiome and others with no microbiome at 
all. What would happen, Rosshart wondered, if he gave a lab mouse 
a wild microbiome? That would preserve the mouse’s genetic back- 
ground but push its physiology closer to that of its wild cousins. 

Rosshart had specific requirements for his wild-microbiome donor: 
he wanted an adult, genetically similar to a lab mouse and free of path- 
ogens so it didn’t risk infecting other mice at the US National Institutes 
of Health (NIH). “I tried to convince Stephan that that’s a very bad 
research idea because it's very difficult,” says Rehermann. But Rosshart 
could not be dissuaded. So each morning, he drove to between 3 and 
10 barns, emptied more than 100 mouse traps and drove back to NIH 
with the mice. He then dissected them and preserved their tissue and 
faeces. In the evening, he retraced his route, collecting even more mice 
and baiting new traps with peanut butter. His days began at 4:30 a.m. 
and ended around midnight. He followed this routine seven days a 
week for two months. “When you do this for, like, one week it’s fun, 
but after a while it gets very challenging,” he says. 

By the end, Rosshart had handled more than 800 mice. He and 
his colleagues selected three with the right genetics and no sign of 
pathogens. They transferred microbes from the animals’ faeces to 
pregnant germ-free mice. When those mice gave birth, they passed 
this microbiome to their pups. The team compared this group with 
germ-free mice that had a microbiome derived from the sanitized 
lab environment. 

Then they infected the mice with a mouse-adapted flu virus; 92% 
of the wild-microbiome mice survived, compared with just 17% of 
mice with the standard lab microbiome’. The wild-microbiome mice 
also developed less-severe disease when the researchers exposed 
them to chemicals that cause colon cancer. “The provocative hypoth- 
esis is that if you make a mouse more like a real mouse in the natural 
world, this becomes a better model for humans who also live in the 
natural world,” Rehermann says. 

More wildness doesn't always lead to greater infection-fighting 
power, however. Last month, Andrea Graham, an evolutionary ecolo- 
gist at Princeton University in New Jersey, and her colleagues showed 
that letting lab mice re-wild themselves makes them more susceptible 
to worm infections*. Graham gave her lab mice free run of eight out- 
door enclosures. When she released the first batch, they immediately 
began exploring the enclosure, digging burrows and sampling new 
food. “They were blissed out. They pulled a couple of all-nighters,’ she 
says. The microbes they encountered significantly affected the mice’s 
ability to control some types of parasite. Mice in Graham's lab tend to 
clear parasitic infections rather quickly. But outdoors, “within a couple 
of weeks they had huge worm burdens’, she says. The researchers are 
still trying to unpack why that might be, which could help to reveal 
how the immune system works in a more natural environment. Per- 
haps the system prioritizes fighting deadly microbes — viruses and 
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bacteria — over less-fatal infections such as worms, says Rosshart. 
“The immune response cannot be perfect against everything,” he adds. 

The dirty models have generated a great deal of excitement. “In 
many ways, they are landmark studies,’ says Alexander Maue, head 
of microbiome products and services at Taconic Biosciences, a 
breeder and supplier of lab animals based in Rensselaer, New York. 
These dirty mice, he says, will allow researchers “to look at different 
mechanisms of protective immunity that you wouldn't find in the 
normal mouse model”. 


MODELS FOR THE MASSES 

But researchers don’t yet know which models will work best for 
which research questions. In Masopust’s version, for example, each 
group of lab mice gets a different cocktail of pathogens. That’s both 
a curse anda blessing, Masopust says, because humans are variable, 
too. In Virgin's design, the mice get a defined set of pathogens, but 
the impact on the immune system isn’t quite so robust. 

Eleanor Riley, an immunologist at the University of Edinburgh, UK, 
says none of these models can fully replicate what happens in nature’. 
Wild mice differ from lab mice in many ways: diet could play a part, 
or sex, daylight or temperature. “I think we need to work more with 
ecologists and zoologists and look at the real world,’ she says. “There 
is a danger of taking a slightly reductionist, simplistic approach.” 

Even recreating such a simplistic version of the wild in a lab is a 
headache, says Virgin. “I don't think people have any question that 
this is important, but actually doing the experiments requires a lot 
of infrastructure.” The wild-microbiome model gets around many of 
the problems of working with pathogens, but as Rosshart well knows, 
catching wild mice comes with its own challenges. 

Whether dirty mouse models represent the human condition better 
than standard lab mice — and provide a better testing ground for 
drugs — also remains to be seen. The ideal experiment would involve 
taking a therapy that failed in clinical trials and retesting it in the new 
models to see whether the results match what happened in humans. 

That's exactly what Masopust’s group is doing, working with two 
drug companies. One has a therapy that failed in human studies, and 
the company would like to know whether the dirty mice could have 
predicted that failure. Another asked Masopust to use his mice to test 
a candidate therapy that works well in clean mice. The preliminary 
data suggest that it does not have much of an effect in dirty mice. 

Colonies of dirty mice are springing up in other places. Daniel 
Campbell, an immunologist at the Benaroya Research Institute in 
Seattle, Washington, received a grant from the NIH last December 
to set up his own collection. He and his colleagues want to test treat- 
ments they have developed for autoimmunity, in which the immune 
system starts attacking healthy tissues. Therapies for such conditions 
seem to work well in pathogen-free mice. But “a lot of those have 
not translated real well into humans’, he says. Campbell thinks dirty 
mice, which have a more developed immune system than standard 
lab mice, might be a more realistic model in which to test those 
therapies. For example, they might allow researchers to better detect 
unwanted side effects. “The concern is safety,’ he says. 

Campbell says that getting the co-housing model up and running 
has been challenging, but he thinks the results will be worth the 
trouble. And many of his colleagues have questions that theyd like 
to test on dirty mice once the colony is ready. “I think there's a lot of 
interest,” he says. “I think they'll all wantin” m 


Cassandra Willyard is a freelance science journalist based in 
Madison, Wisconsin. 
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To reduce people’s consumption of added sugar, various governments have introduced a tax on sweetened drinks. 


Reward food companies for 
improving nutrition 


Governments must provide incentives for businesses to fix the global food system, 
not just punish them for acting irresponsibly, argues Lawrence Haddad. 


his month, the UK government 
"Tine the Soft Drinks Industry 
Levy. Producers, packagers and 
importers of beverages that contain 5 grams 
of sugar or more per 100 millilitres will have 
to pay a tax. Some might increase their prices 
to cover the cost, which could discourage 
buyers. The hope is that most firms will 
make their products less sweet to avoid it. 
More than 20 countries now apply some 
variant of a ‘sugar tax. Various studies show 
that it can reduce people's consumption of 
added sugar”. After Mexico’s government 


introduced a tax on sweetened drinks in 
2014, for instance, sales in 2015 fell by nearly 
10% (ref. 3). 

Such ‘sticks’ policies that punish food 
and drink companies for harming people’s 
nutrition, are popular with governments, 
United Nations agencies, non-governmen- 
tal organizations and others. But in my view, 
a fundamental — and often justified — dis- 
trust of industry means that those trying 
to fix food systems are missing opportuni- 
ties to encourage private-sector businesses 
to do more good things for nutrition, not 


just fewer bad things. We should use ‘policy 
carrots’ too. 


HISTORY OF DISTRUST 
Over the past few decades, many of those 
who work to improve people’s nutrition 
have seen businesses as part of the problem, 
not as an essential part of the solution. Much 
of the wariness stems from how companies 
have promoted breast-milk substitutes and 
sugary drinks. 

Since 1981, the International Code of 
Marketing of Breast-milk Substitutes > 
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> has sought to protect the exclusive 
breastfeeding of infants younger than six 
months, and to position it as a complement 
to other foods for older infants. Adopted by 
the World Health Assembly, the decision- 
making body of the World Health Organi- 
zation (WHO), the code aims to shield 
mothers, health workers and health-care 
systems from commercial promotion that 
undermines breastfeeding. 

Yet producers in some countries often 
violate this code, for example by encourag- 
ing health facilities to include formula milk in 
the packs given to new mothers or by offering 
it free or discounted to pregnant women’. 

The marketing and lobbying techniques 
used by some producers of sugary drinks 
to target children are similarly scandalous. 
Examples include branding educational 
materials, embedding advertisements for 
unhealthy food in computer games, or 
using toys to market such foods to children 
in restaurants’. Such drinks significantly 
increase people's risk of developing type 2 
diabetes, heart disease and other chronic 
conditions. Some producers also refuse to 
take at least some responsibility for the rise 
in obesity throughout Latin America, Africa 
and Asia — a trend that correlates with an 
upswing in the consumption of soft drinks in 
these regions over the past 15 years. 

These flashpoints in nutrition prob- 
ably explain why policy carrots are rarely 
deployed, despite numerous studies indi- 
cating their potential value®. Of the coun- 
tries that informed the WHO about their 
fiscal policies to promote healthy diets in 
2016-17, more than half had increased taxes 
on unhealthy foods and beverages. Less than 
one-quarter had introduced subsidies to 
lower the cost of healthier alternatives’. 

But businesses are the main inves- 
tors in the world’s food systems. In 2016, 
Hershey and General Mills each spent 
more than US$500 million on advertis- 
ing alone (see go.nature.com/2u3jttr). In 
2014, international aid donors spent just 
$50 million in total on combating diet- 
related chronic disease* (see “Top investors’). 

Punitive policies, government guidelines 
on eating healthily and legislation for food 
safety won't be enough to alter food systems 
such that more people are better nourished. 
Governments must also give incentives to the 
main investors in such systems so that they 
playa much more positive part in improving 
nutrition. 


WORKING TOGETHER 

For 25 years, I worked solely in the public 
sector. I now direct a non-governmental 
organization that supports public-private 
approaches to promote the availability, 
affordability and desirability of nutritious 
food — the Global Alliance for Improved 
Nutrition (GAIN) in Geneva, Switzerland. 
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Just 18 months in the job has convinced me 
that many in the private sector are willing to 
adjust their businesses to make money and 
improve people's nutrition at the same time. 

Some heads of companies have their own 
reasons, such as diet-related chronic disease 
in their family. For other companies, dedi- 
cating resources to causes such as nutrition 
can pull in talented and driven employees. 
Overall, I have been struck by the commit- 
ment, knowledge and integrity of many in 
this newly discovered private-sector ‘tribe’ 

Public-private collaborations could 
improve nutrition in many ways. The top 
10 multinationals produce more than 50% 
of all soft drinks. But the top 10 ‘packaged 
food’ companies — which supply branded 
products sold in shops and supermarkets — 
account for only 15% of these sales in the 
world’. So those in the public sector should 
work with small, medium and large national 
companies, not just the vast multinationals. 
Partnerships could even involve companies 
that arent in the food sector. 

Mobile-phone companies, for instance, 
can send people government-approved 
text messages about how to eat healthily, or 
spread links to information about healthy 
diets. By providing this public service, they 
attract more customers”. Such an approach 
is being tried in Tanzania, Bangladesh and 
Ghana with the coordination of GSMA, the 
membership organization for more than 
300 mobile-phone providers. 

Likewise, marketing and advertising com- 
panies can help those in the public sector to 
improve the ‘stickiness’ of media messaging 
around nutrition. As an example, in 2014, 


TOP INVESTORS 


Globally, companies have much deeper pockets 
for food promotion than have governments and 
non-governmental organizations. 


Advertising spend of just 2 of the top 25 
US food and beverage companies 
(Hershey and General Mills) in 2016. 


US$50 


MILLION 


International aid invested in 
nutrition-related diseases in 2014. 


the Indonesian government collaborated 
with a creative agency in Indonesia, GAIN 
and the London School of Hygiene and 
Tropical Medicine. The result was a one- 
minute video during which a mother gos- 
sips about how everyone else is failing to feed 
their children properly. The use of humour 
and emotion seemed to work, in contrast to 
standard government-produced instruc- 
tions about what people should be eating. 
An independent evaluation of the cam- 
paign indicated that it helped 50% of the 
6- to 23-month-old infants in the assessed 
villages to meet a nutrient adequacy thresh- 
old, compared with 36% of infants in the 
control villages”. 

Companies that specialize in food trans- 
port or packaging can help to reduce food 
loss during storage and distribution using 
relatively low-cost technologies, such as 
repurposed storage containers or cheap 
insulating materials. (Perishable foods such 
as fruit tend to be higher in micronutrients 
than longer lasting ones such as cereals.) Last 
year, led by the Lagos state government, we at 
GAIN connected tomato farmers in Nigeria 
with commercial suppliers of reusable plas- 
tic crates. Studies in Asia have shown that 
such crates, used in place of wicker baskets, 
can reduce the loss of fruits and vegetables 
along the supply chain from 30-50% to 5% 
(ref. 12). 

With a mix of public- and private-sector 
technical and financial assistance, small- and 
medium-sized businesses in, say, horticul- 
ture and aquaculture could make their prod- 
ucts more available, affordable, desirable and 
profitable. Since 2013, GAIN has been work- 
ing with around 500 such firms to get more 
servings of nutritious foods (such as beans, 
fish, peanuts and chicken) into markets in 
five countries in Africa and Asia, and to 
make those servings cheaper. Independent 
evaluations show some achievements. For 
example, one firm in Kenya has helped to 
make tilapia fish affordable for 68% of the 
population (up from 49%) in the region 
where it is operating”. 


FIVE STEPS TO BETTER FOOD 

So how do governments and others that are 
striving to improve nutrition identify and 
seize opportunities to change behaviours? 
Five things need to be done. 


Support businesses that work with nutri- 
tious foods. Governments frequently create 
export-processing zones or business parks 
with reduced rents or tariffs for exported 
goods, say, to promote business types that 
boost economic growth. So why not cre- 
ate business parks for producers of nutri- 
tious foods, with lower rents and taxes and 
cheaper electricity and water supplies? 
Most of the small- and medium-sized 
businesses aiming to become the next 
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Swapping wicker baskets for crates provided by packaging firms better protects tomatoes in transit. 


food giants find it hard to access financial 
services. These companies are too high-risk 
to attract investment from banks and tend to 
be ineligible for microfinance schemes that 
provide small loans (around $1-10) to very 
poor households. Governments could create 
financial instruments, such as low-interest 
loans for nutritious-food suppliers, to meet 
their needs. 

Governments could also develop effec- 
tive ‘quality seals’ targeted to specific groups, 
such as street-food vendors or institutional 
caterers, to certify that a food is healthy (or 
unhealthy). Assessments suggest that ‘traf- 
fic-light systems’ of red, amber and green 
ratings used in the United Kingdom and 
Australia, or the black stop-sign labels used 
in Chile, seem to be effective ways to steer 
people towards better nutrition™. 


Create demand for healthy foods. Busi- 
ness leaders often tell me that if consum- 
ers wanted more nutritious foods, they 
would meet that demand. Yet businesses 
shape demand, and some bend it towards 
unhealthy foods — mainly because these 
are easy to produce at scale and to transport, 
market and sell at a significant mark-up. 
Governments must take the lead when 
it comes to building consumer demand for 
healthy foods — much as they have changed 
people's behaviour around smoking and 
drink-driving. That means partnering with 
non-profit foundations and creative agencies 
to make health messages about food accurate 
and memorable instead of worthy and dull. 
In the United Kingdom, the Food Founda- 
tion, a non-profit organization working to 
improve nutrition, collaborated with the 
creative agency ifour in late 2017 to create 
messages and images that tap into children’s 
interest in superheroes to encourage them 


to eat vegetables. The impact of this has not 
yet been rigorously evaluated, but a related 
initiative targeting households in a province 
of Ecuador in 2015 increased egg consump- 
tion to one a day in 6- to 9-month-olds, and 
improved growth”. Providing such incen- 
tives will boost consumer demand and thus 
encourage companies to meet it. 


Create models to emulate. Both govern- 
ments and businesses need evaluated exam- 
ples of things they can do together that work. 
Much of the evidence for the effectiveness 
of public-private partnerships comes from 
other sectors, such as health, infrastruc- 
ture and education; from the unpublished 
reports of public and private organizations; 
or from the minds 


“Governments of those involved. 
canmakeithard 4 2016 literature 
for businesses to review concluded 
do good things that “there are few 


for nutrition.” independent, rig- 
orous assessments 
of the impact of commercial sector engage- 
ment in nutrition”.’° 

UN agencies, non-governmental organiza- 
tions, businesses, researchers and nutrition 
champions need to do a better job of dissem- 
inating the lessons learned, for instance by 
creating a knowledge repository. This could 
be similar to the World Bank's free-to-access 
website on public-private partnerships for 
the building of roads, ports and other infra- 
structure (see go.nature.com/2ptdgqm). 


Name and fame — or shame. Businesses 
can derail public-health initiatives and dis- 
tort publicly available research to suit their 
own ends. For instance, a 2015 investiga- 
tion by the British Medical Journal found 
that researchers at UK advisory bodies had 


received funding from major soft-drinks 
companies’. How exactly this affects 
research is not yet known, but it clearly 
undermines trust. 

Likewise, governments can make it hard 
for businesses to do good things for nutri- 
tion — either through a lack of awareness or 
through poor planning. For example, some 
governments impose a tariff on imported 
premix, the micronutrient-rich compound 
that is used in small amounts to fortify staple 
foods, such as wheat or maize (corn). The 
tariff can dissuade food processors from 
implementing this cost-effective public- 
health strategy. 

A ranking scheme is needed to flag which 
governments and businesses are doing posi- 
tive or harmful things for nutrition. 

One game changer has been the Access to 
Nutrition Index (wwwaccesstonutrition.org). 
Released every three years, the index uses 
mainly self-reported information to evalu- 
ate the world’s 22 largest multinational food 
and beverage manufacturers on their poli- 
cies, practices and performance in relation to 
under-nutrition and obesity. Although it has 
begun to produce national reports, further 
national, independent and evidence-based 
assessments are needed. 

The World Bank currently ranks 
190 national economies according to how 
easy it is to start and operate a firm in that 
country. In principle, governments could 
similarly be ranked according to how easy 
they make it for businesses to produce avail- 
able, affordable and desirable nutritious 
food that can, for instance, reduce the pro- 
portion of women experiencing anaemia or 
the percentage of children who are obese. 
Such assessments (perhaps conducted 
jointly by the World Bank, the WHO and 
the Food and Agriculture Organization of 
the UN) would help to reveal what kinds of 
government action actually help businesses 
to improve nutrition. 


Foster public-private engagement. More 
dialogue between people working on nutri- 
tion in the public and private sectors will 
catalyse all these other steps. 

Major differences in culture, language and 
networks exist between those concerned 
with food systems in the public and pri- 
vate sectors. In fact, before I joined GAIN, I 
talked and worked only with academics, pro- 
gramme implementers and policymakers. 

The accountability measures I describe 
could help those in the public sector to 
decide who to partner with. Also, thorough 
and pragmatic conflict-of-interest guidelines 
will help to reveal when public-health goals 
are at risk. 

There are numerous ways to foster more 
dialogue. Panels at conferences should 
include participants from both sectors. 
Public funders could incentivize joint 
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proposals for nutrition research from 
public-private collaborations. Com- 
panies and public-sector organizations 
could set up staff exchange programmes. 
And executive-level courses, either at 
universities or in private institutions, 
could bring together professionals from 
both sectors to learn from instructors 
drawn from these two worlds. 

Many analysts (myself included, in 
the past) have drawn parallels between 
‘big tobacco and ‘big food: In both cases, 
major corporations wield immense 
power over consumers and society, 
and their products are capable of doing 
considerable harm. 

But there are crucial differences. Unlike 
big tobacco, big food is not the only player. 
There are small- and medium-sized com- 
panies too. And big tobacco cannot make 
tobacco that promotes public health, 
whereas big food can and does produce 
nutritious, sustainable foods. Motivated 
by both carrots and sticks, the industry 
can produce more — at a lower price. m 


Lawrence Haddad is executive director of 
the Global Alliance for Improved Nutrition 
(GAIN) in Geneva, Switzerland. 

e-mail: |haddad@gainhealth. org 
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Developing countries 
must lead on solar 
geoengineering research 


The nations that are most vulnerable to climate change 
must drive discussions of modelling, ethics and 
governance, argue A. Atiq Rahman and colleagues. 
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eople in the global south are on the 
Pp front line of climate change. As global 

temperatures creep upwards, the 
Intergovernmental Panel on Climate Change 
(IPCC) is forecasting rising seas eroding 
small island states’, declining food produc- 
tion in many regions of Asia’, water stress 
across Africa® and major loss of biodiversity 
in South America’. 

Developing countries have spoken out on 
climate policy. Links between climate justice 
and development are now accepted, as is the 
idea that nations have common respon- 
sibilities — emitters are liable for impacts 


A group of villagers stands beside 
the Jamuna River in Bangladesh, 
where erosion is eating into the 


felt elsewhere. Despite having emitted very 
little greenhouse gas themselves, the world’s 
least-developed countries and small-island 
states demanded that the 2015 Paris climate 
agreement require warming to be kept “well 
below” 2°C, and that a 1.5°C limit should 
also be explored. 

But there is a limit to what populations 
threatened by sea-level rise, biodiversity loss, 
droughts and hurricanes can do. Mitigation of 
climate change is crucial. The emissions cuts 
agreed in Paris are not enough — they will 
take the world to a 3°C rise (see go.nature. 
com/2u3ybkh). Adaptation is therefore 


essential. As the scale of the damage grows, 
more countries will turn to the “loss and 
damage” provisions in the Paris agreement. 
And these are vague: who should pay how 
much, and to whom, for lost farming or fish- 
ing livelihoods? What size of cheque would 
compensate for the destruction of coral reefs? 

In that context, solar geoengineering 
— injecting aerosol particles into the strato- 
sphere to reflect away a little inbound sun- 
light — is being discussed as a way to cool the 
planet, fast. The technique is controversial, 
and rightly so. It is too early to know what 
its effects would be: it could be very helpful 
or very harmful. Developing countries have 
most to gain or lose. In our view, they must 
maintain their climate leadership and play 
a central part in research and discussions 
around solar geoengineering. 


HIGH STAKES 
Solar geoengineering is outlandish and 
unsettling. It invokes technologies that are 
redolent of science fiction — jets lacing 
the stratosphere with sunlight-blocking 
particles, and fleets of ships spraying sea- 
water into low-lying clouds to make them 
whiter and brighter to reflect sunlight. Yet, 
if such approaches 


“Thereis a could be realized 
limit to what technically and 
populations politically, they 
threatened by could slow, stop or 
sea-level rise, even reverse the rise 
biodiversity in global tempera- 
loss, droughts tures within one or 
andhurricanes _ twoyears. No other 
cando.” way of doing this 


has been conceived. 
Removing greenhouse gases from the air 
would take decades, if it is even possible. 

A decade of modelling research indicates 
that solar geoengineering might reduce 
many of the worst effects of climate change 
if deployed in moderation. For example, 
injecting 5 megatonnes of sulfur dioxide 
into the stratosphere — about one-quarter of 
that released by Mount Pinatubos eruption 
in 1991 — each year could keep warming 
below 2 °C. (However, there are likely to be 
limits to how much cooling can be achieved, 
especially under high greenhouse-gas 
emissions scenarios’.) Studies have found 
that solar geoengineering should also be 
able to reduce climate impacts on hydrology, 
redressing trends in which wet regions get 
wetter and dry regions get drier®. Lower tem- 
peratures would slow global sea-level rise’ 
and could curb the increasing incidence and 
strength of tropical cyclones*. 

A decade ago, there were serious concerns 
that solar geoengineering might produce 
stark winners and losers and might disrupt 
the monsoons. Research has allayed these 
worries. For example, it seems conceivable 
that moderate solar geoengineering would 
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benefit many regions that are vulnerable to 
climate change, with few losers. Monsoon 
rains would be affected less than if climate 
change proceeds unchecked”. 

But solar geoengineering is no panacea; 
it could compound some risks of climate 
change. It would only mask the warming 
effect of greenhouse gases. Ocean acidifica- 
tion would still pose a threat to marine life if 
carbon-dioxide emissions were not slashed. 
Sulfur dioxide might delay ozone regenera- 
tion in the stratosphere. And whichever 
aerosol was used to filter out sunlight, more 
research would be needed on its impacts on 
health and the environment. 

The overall effects of solar geoengineering 
are uncertain. All studies so far are based on 
computer simulations, which are poor at 
forecasting regional climates, for example. 
The Earth system might hold surprises that 
digital models do not capture. The projections 
require thorough and sceptical examination. 

Furthermore, solar geoengineering raises 
difficult socio-political issues that cannot be 
wished away. It is uncertain how, or whether, 
the technique could be governed in ways that 
ensure prudence, accountability and justice. 
Who has the right to implement an inherently 
global technology? Would the technology 
weaken multilateral commitments to reduce 
emissions such as the Paris agreement? 

These issues matter deeply to developing 
nations. But most solar-geoengineering 
research is being done in the well-heeled 
universities of Europe and North America. 
Unless that changes, voices from the global 
north will set the policy agenda and decide 
which research projects should be accelerated 
or shut down. 

We are neutral on whether solar geo- 
engineering should ever be used. It has not 
yet been established whether it would bea 
beneficial addition to meeting the Paris goals. 
We recognize its potential physical risks and 
socio-political implications. And we oppose 
its deployment until research into its safety 
and effectiveness has been completed and 
international-governance mechanisms 
established. But we are committed to the 
co-production of research and to well- 
informed debate. 

Others have already taken sides. Some 
people in the global north have tried to 
convince their peers in the south that they 
should reject solar geoengineering. Cam- 
paigners who vehemently oppose it often 
make their case by emphasizing the risks 
and playing down the potential benefits’”. 
We take issue with this paternalism and 
propose an inclusive way forward. 


BIG DECISIONS 

Developing countries must be in a posi- 
tion to make up their own minds. Local 
scientists, in collaboration with others, 
need to conduct research that is sensitive 
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to regional concerns and conditions. For 
example, what effects might solar geo- 
engineering have on hurricanes in the 
Caribbean, flooding in Bangladesh or 
agriculture in East Africa? Broader discus- 
sions among academics, policymakers, the 
public and public intellectuals are needed 
on climate risks and justice. 

To begin this process, we (and the 
co-signatories of this Comment) have been 
running solar-geoengineering engagement 
workshops across the global south — the 
first of their kind — as part of the SRM 
Governance Initiative (SRMGI), in which 
SRM stands for solar radiation manage- 
ment. International and non-governmental, 
SRMGI was launched in 2010 by the Royal 

Society in London, 


“Solar The World Acad- 
geoengineering &™y of Sciences 
raises difficult | (TWAS) in Trieste, 
socio-political Italy, and the Envi- 
issues that ronmental Defense 
cannot be Fund in New York 


City. The regional 
workshops — held 
mostly in the past 
three years in Bangladesh, Brazil, China, 
Ethiopia, India, Jamaica, Kenya, Thailand, 
New Zealand (for the Pacific states), Pakistan 
and the Philippines — have brought together 
local climate scientists, journalists, policy- 
makers and representatives of civil society to 
learn about and discuss solar geoengineering. 

Participants had no consensus position 
on the technology. But they raised common 
hopes and concerns. In general, we found 
widespread opposition to deployment at this 
stage, but support for studies of local impacts. 
Asa participant at the Nairobi workshop put 
it: “This idea is crazy... but we have to under- 
stand it.” Many were sceptical about whether 
the methods would work and if developing 
countries, rather than more powerful gov- 
ernments, would have any say in how and 
whether solar geoengineering is deployed. 

To fund regional research, this week, 
SRMGI issues the first call for applications to 
a US$400,000 fund called Developing Coun- 
try Impacts Modelling Analysis for SRM 
(DECIMALS). The fund is administered by 
TWAS and financed by the Open Philan- 
thropy Project, a foundation backed by Cari 
Tuna and Dustin Moskovitz (co-founder of 
Facebook and the project-management app 
Asana). Developing-world scientists can 
apply to DECIMALS for funds to model the 
solar-geoengineering impacts that matter 
most to their regions. International collabo- 
rations will be supported and researchers 
will be asked to run local workshops to pro- 
mote wider discussion of the implications of 
their findings. 

Further outreach and research in the devel- 
oping world will require extra support from 
governments, universities and civil society 


wished away.” 


worldwide. Research funders in advanced 
economies should fund collaborations with 
scientists in developing countries. We would 
like to see an IPCC special report on the 
risks and benefits of solar geoengineering. 
Ultimately, a coordinated global research 
initiative — perhaps under an organiza- 
tion such as the World Climate Research 
Programme — is needed to promote 
collaborative science on this controversial 
issue. 

Solar geoengineering is fraught with risks 
and can never be an alternative to mitiga- 
tion. But it’s unclear whether the risks of 
solar geoengineering are greater than the 
risks of breaking the 1.5°C warming target. 
As things stand, politicians will face this 
dismal dilemma within a couple of decades. 
It is right, politically and morally, for the 
global south to have a central role in solar- 
geoengineering research, discussion and 
evaluation. = 
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Hans Rosling discusses population growth at the ReSource 2012 conference in Oxford, UK. 


STATISTICS 


Swansong of a data visionary 


Jim O’ Neill hails the last book by Hans Rosling, the statistician who recast progress. 


ans Rosling was many things. A 
Hoerscin and epidemiologist; a 

statistician and data visualizer; a 
staunch advocate of free data as the bed- 
rock of an accurate world view. Factfulness 
is Rosling’s last, and posthumous, book; he 
died in February 2017. Like his other work, 
including his famous presentations, it throws 
down a gauntlet to doom-and-gloomers in 
global health by challenging preconceptions 
and misconceptions. 

Co-written with Ola Rosling and Anna 
Rosling Rénnlund (Rosling’s son and 
daughter-in-law), Factfulness is a fabulous 
read, succinct and lively. It asks why so 
many people — including Nobel laureates 
and medical researchers — get the numbers 
so wrong on pressing issues such as poverty, 
pandemics and climate change. The book 
isolates the ten instincts that lead to what 
Rosling calls the “overdramatic worldview”: 
that pervasive, generally pessimistic global 
perspective that often cancels out significant 
progress made in the face of vast challenges. 

The “gap instinct”, for instance, is 
the tendency to divide everything into 


two — such as devel- 
oped and developing 
countries. This ‘us 
and them construct, 
he shows, is not borne 
out by the facts: some 
75% of people live in 
middle-income coun- 
tries. Rosling adds 
nuance to the global 
socio-economic pic- 
ture by outlining four 
income levels, from 
US$1 a day to more 
than $64. He offers a 
masterclass in avoiding 
the gap instinct by, for 
example, comparing 
averages — one num- 
ber can hide a range. 


Factfulness: Ten 
Reasons We’re 
Wrong About the 
World — and Why 
Things Are Better 
Than You Think 
HANS ROSLING 

WITH OLA ROSLING 
AND ANNA ROSLING 
RONNLUND 
Flatiron: 2018. 


Many of the other instincts that Rosling 
examines reveal his acuity about how easy 
it is for us to slide into negative, fearful or 
grandiose mindsets when faced with mono- 
lithic problems. His discussion of the “blame 
instinct”, for example, is important because it 
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reminds us of people's impatience with com- 
plex causation and tendency to pin everything 
on one agent. As he notes, problems such as 
lack of research into the diseases of the poor- 
est people are systematic. To change them, 
you have to understand the systems. 
Rosling enlivens the chapters with 
multiple-choice questions. These highlight 
our knee-jerk responses and the scale of 
progress in a number of areas. (He jokes that 
chimpanzees choosing at random would 
generally outperform humans on these.) For 
example, he asks: if there are 2 billion children 
in the world today, how many will there be by 
2100 — 4 billion, 3 billion or 2 billion? I got 
this wrong. Another question is how many of 
the world’s one-year-old children have been 
vaccinated against at least some form of dis- 
ease: 20%, 50% or 80%? This I got right, but 
few people do. These riveting exercises kept 
me up late one evening: I couldn't wait to see 
how stupid I was with the next question. 
Rosling also covers five risks that we 
“should worry about”: global pandemic, 
financial collapse, a new world war, climate 
change and the extreme poverty that still > 
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> afflicts 700 million people. Factful- 
ness is not, however, comprehensive. I 
have wondered whether Rosling would 
have been his usual cheery self when con- 
fronted with the challenges of antimicro- 
bial resistance (AMR), and the needed 
solutions. I had the chance to discuss this 
issue with him only briefly. Its absence 
from the book is disappointing, but then, 
I'm biased. I led a global review on AMR 
for the UK government under then-prime 
minister David Cameron, during which I 
departed from my own generally cheery 
take on the state of the world. 

I came to Rosling’s work in the past 
decade, when someone at Goldman Sachs 
Asset Management (which I chaired) sug- 
gested that I should follow one of Rosling’s 
online presentations about the state of the 
world, its evolution and probable future. 
It was wise advice. Although I initially 
sulked because 


oftheinference “J]gyed 

that Rosling Rosling’s 

discussed such positive 

information approach to 

‘saab meant life’s challenges; 
soon realizee it did, and does, 


that he took 
aspects of what 
I had become 
known for to a completely new level. His 
knowledge of issues around health, dis- 
ease control and many aspects of devel- 
opment was obviously deep and broad. A 
few years later, I shared a platform with 
him. I loved his positive approach to life’s 
challenges; it did, and does, inspire me. 

Long before US President Donald 
Trump and fake news, I warned audiences 
about slavishly believing what they read. 
As Rosling spells out, our awareness of 
facts has never been that strong, and many 
journalists are not motivated to tell us that 
the world is getting better. (No news is 
sometimes the only good news.) It is up 
to individuals to ensure that we foster the 
disciplined habits of mind that Rosling 
eloquently and clearly sets out. 

This magnificent book ends with a 
plea for a factual world view. Rosling was 
optimistic that this outlook will spread, 
because it is a useful navigational tool in 
a complex world, and a genuine antidote 
to negativity and hopelessness. A just 
tribute to this book and the man would 
be a global day of celebration for facts 
about our world. Perhaps Trump should 
lead the charge on that. m 


inspire me.” 


Jim O’Neill is an economist, a cross- 
bencher in the UK House of Lords, a 
distinguished visiting fellow at the think 
tank Chatham House in London, and 
honorary chair of economics at the 
University of Manchester, UK. 
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From meat to mind 


Douwe Draaisma enjoys Michael Gazzaniga’s 
exploration of the biological basis of consciousness. 


René Descartes visited the French 

philosopher at his estate on the Dutch 
coast. Aiming to discuss the cardinal points 
of Cartesian philosophy, Frans Burman had 
marked more than 70 passages in Descartes's 
works. How, he asked, can soul and body 
affect each other, given their fundamental 
difference? Descartes conceded that the 
question was thorny, but pointed to the 
evidence that they do — for instance, in 
emotion. The mystery lies in the mechanism, 
and this, Descartes confided, was perhaps 
best left to theologians. 

Neuroscientist Michael Gazzaniga tackles 
this abiding mind-body problem anew in 
The Consciousness Instinct. His subtitle, 
Unraveling the Mystery of How the Brain 
Makes the Mind, rephrases Descartes’s 
conundrum into a bold promise. But then, 
Gazzaniga is a bold scientist. He made his 
name in the 1960s through pioneering 
work on severing the connection between 
the brain's left and right hemispheres (‘split 
brains’), as his autobiography vividly details 
(D. Draaisma Nature 518, 298-299; 2015). 

His latest book is certainly evidence that 
scholars advancing in age (Gazzaniga is 
now 78) often trust themselves with ever 
broader scientific and philosophical ques- 
tions. Thus he guides readers through neu- 
rology, biology and psychology, discussing 
the origin and neural underpinnings of lan- 
guage or the mechanism of facial recognition. 


E April 1648, a young admirer of 
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fallacy” — the idea that 
one theory can cover 
everything, from our introspective sense of 
awareness down to the subatomic particles of 
brain tissue. Explanations, he asserts, should 
be thought of as context-dependent, just as 
light in quantum physics sometimes behaves 
like waves and sometimes like particles. 
Gazzaniga defines consciousness as “the 
subjective feeling of a number of instincts 
and/or memories playing out in time in an 
organism”. He points out that clinical cases — 
he spent a few years working on neurologi- 
cal wards — add complexities. For instance, 
people who are completely unable to move 
can still be conscious, a frightening condition 


Anerve cell in the human brain, seen in a false colour under a scanning electron microscope. 
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called locked-in syndrome. Consciousness 
might be absent in sleepwalking. Thus, 
coupling it to behaviour is misleading. 

Nor is it straightforward to link con- 
sciousness to parts of the brain. One of 
Gazzaniga’s earliest findings was that 
disconnecting the left and right hemi- 
spheres produced two separate conscious 
systems; only one, usually supported by 
the left brain, was able to express itself in 
language. It had been assumed that con- 
sciousness co-evolved with the cerebral 
cortex, supporting ‘higher functions 
such as language and reasoning. But refer- 
ring to the work of neuroscientist Bjérn 
Merker, Gazzaniga makes the case that 
consciousness might not be necessarily 
— or exclusively — locked into cortical 
and linguistic processes. In some children 
born with a seriously compromised fore- 
brain, the damaged tissue gets replaced by 
fluid (hydranencephaly). They grow up 
lacking language, but still express feelings 
and have subjective experiences. Accord- 
ing to Gazzaniga, consciousness might 
actually originate in the evolutionarily 
older midbrain, with the cortex providing 
“a collection of extensions (apps!) to 
enhance conscious experiences”. 

Inan engaging discussion of the brain's 
architecture, he offers a mundane simile 
for consciousness. The brain should be 
thought of as a multitude of modules, each 
specialized for a single task, such as rec- 
ognizing patterns or monitoring rhythm 
in music. The end products of these 
modules rise to the surface and burst like 
“bubbles in a boiling pot of water’, each a 
fleeting part of our awareness. Our sub- 
jective sense of continuity, described by 
pioneering psychologist William James 
as “stream of consciousness’, might be 
illusion: we merely experience the rapid 
succession of elements as a smooth 
movement, like the frames of a film. The 
metaphor of the bubbles seems first and 
foremost an invitation to generate a test- 
able theory, and Gazzaniga'’s observations 
will almost certainly provide much of the 
test material. 

Gazzaniga ends by reflecting that 
the ultimate explanation for how mind 
emerges from meat might not prove 
“warm and cuddly” Instead, it might vie 
with quantum mechanics for sheer coun- 
ter-intuitive weirdness, hovering “way 
beyond our intuitions and imaginations” 
Once again we seem to hear what Burman 
heard, 370 years ago: a sigh of resignation, 
as Descartes indicated that it might all be 
better left to the theologians. m 


Douwe Draaisma is professor of the 
history of psychology at the University of 
Groningen in the Netherlands. 

e-mail: d.draaisma@rug.nl 
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Tom Lehrer performing in San Francisco, California, in 1965. 


MATHEMATICS 


Tom Lehrer at 90 


Andrew Robinson looks back at the scientific high notes 
in the mathematician and satirist’s inimitable oeuvre. 


Tom Lehrer — who turns 90 this 

month — performed what he char- 
acteristically called a “completely point- 
less” scientific song at Harvard University 
in Cambridge, Massachusetts. (He was 
a PhD student there at the time.) ‘The 
Elements, now one of his most cherished 
works, sets the names of all the chemical 


I n 1959, the mathematician and satirist 


elements then known to the tune of the 
“Major-General’s Song’ from The Pirates of 
Penzance, the comic opera by W. S. Gilbert 
and Arthur Sullivan. Lehrer’s heroically 
precise, rapid-fire enunciation of 102 ele- 
ments (reordered to allow flawless end- 
rhymes), ends with the much-quoted 
crack, “These are the only ones of which 
the news has come to Harvard/And 
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> there may be many others but they 
haven't been discarvard” 

In the 1960s, Lehrer followed up with 
more than a dozen astringent, cynical 
and often pointedly political songs, such 
as ‘So Long, Mom, I’m Off to Drop the 
Bomb (A Song for World War III). As 
The New York Times had it, 
“Mr. Lehrer’s muse [is] not fet- 
tered by such inhibiting factors 
as taste.” (Lehrer reprinted the 
quote in his album liner notes.) 
In the fraught geopolitics and 
paranoia of the cold war, how- 
ever, Lehrer’s social criticism 
touched a chord with many in 
the United States. Fans might, 
however, have been surprised 
to learn that he had crunched 
numbers for the National Secu- 
rity Agency as an army draftee 
in the mid-1950s. 

Much of Lehrer’s oeuvre — 
some 50 songs (or 37, by his 
own ruthless reckoning) 
composed over nearly three 
decades — played with tensions 
at the nexus of science and 
society. His biggest hit, That 
Was The Year That Was, cov- 
ered a gamut of them. This 1965 
album gathered together songs 
Lehrer had written for That 
Was The Week That Was, the US 
satirical television show spawned by the 
BBC original. “Who's Next?’ exposes the 
dangers of nuclear proliferation. ‘Pollution’ 
highlights environmental crises building 
at the time, such as undrinkable water and 
unbreathable air. 

The rousing ballad “Wernher von Brau’ 
undermines the former Nazi — who 
designed the V-2 ballistic missile in the 
Second World War and later became a key 
engineer in the US Apollo space programme. 
In Lehrer’s view, it was acceptable for NASA 
to hire von Braun, but making him into 
an American hero was grotesque. “‘Once 
the rockets are up, who cares where they 
come down?’/“That’s not my department; 
says Wernher von Braun” — lines that still 
resonate in today’s big-tech ethical jungle. 
“New Math, meanwhile, skewers the educa- 
tion system through the lens of a misfired 
revolution in mathematics, with its telling 
refrain: “It’s so simple, so very simple, that 
only a child can do it” (A. Bellos Nature 516, 
34-35; 2014). 


LYRICAL PRECISION 

Lehrer — who grew up on New York 
City’s Upper East Side — certainly sees 
a connection between his mathemati- 
cal training, which began at Harvard at 
the prodigiously young age of 14, and his 
compositions. He was drawn to songwriting 
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The cover of Tom Lehrer’s debut album, released in 1953. 


in his teens; after failing to respond to classi- 
cal-music training, he switched to the study 
of popular music. In an interview in 2000, 
he summed up the fields’ dual impact. “The 
logical mind, the precision, is the same that’s 
involved in math as in lyrics,” he said. “It’s 
like a puzzle, to write a song” 


0h) ( tA Ce) - : 


Lehrer agrees with mathematician Stan- 
islaw Ulam (one of the builders of the 
atomic bomb) that rhyming “forces novel 
associations ... and becomes a sort of auto- 
matic mechanism of originality”. As he told 
me in 2008: “If ‘von Braur didn't happen to 
rhyme with ‘down (and a few other words), 
the most quoted 


“The logical couplet in the song 
mind, the . would not exist, and 
precision, 1s in all probability the 
the same in song itself would not 
math asin have been written.” 

lyrics.” His musical career 


began at university, 
with the spoof sports song ‘Fight Fiercely, 
Harvard’ In the early 1950s, Lehrer put on 
a satirical show in the physics department, 
The Physical Revue (a pun on the name of 
the US journal then named Physical Review). 
With co-performers including Norman 
Ramsey (later a Nobel laureate in physics) 
and Lewis Branscomb (who would become a 
presidential science-policy advisor), he per- 
formed ditties such as ‘Relativity; “Fugue for 
Scientists’ and “The Slide Rule Song: It was a 
training ground for later triumphs. 

He began recording in 1953. Although 
US radio stations refused to play such 
‘controversial’ material, his fame spread 
through word of mouth. In Britain, the 
royal approval of unexpected fan Princess 


Margaret and the support of the BBC signif- 
icantly raised Lehrer’s profile, and he con- 
sidered abandoning academia. But in 1960, 
bored by touring, he returned to Harvard, 
aiming to complete a long-standing math- 
ematics PhD on modes in statistics. Soon, 
however, he concluded he had nothing 
original to contribute academi- 
cally. As he notoriously wrote 
in ‘Lobachevsky’ a song named 
after a nineteenth-century 
Russian mathematician: “Pla- 
giarize!/Let no one else’s work 
evade your eyes!/...So don't 
shade your eyes,/but plagiarize, 
plagiarize, plagiarize/— only 
be sure always to call it, please, 
research.” Lehrer dropped his 
doctorate and began to teach 
mathematics — at the Massa- 
chusetts Institute of Technol- 
ogy in Cambridge in 1962 and, 
from 1972 until his retirement in 
2001, at the University of Cali- 
fornia, Santa Cruz (along with a 
class in musical theatre). 

He also largely gave up song- 
writing and public performing 
in the early 1970s. Following 
the award of the Nobel Peace 
Prize to then-US Secretary of 
State Henry Kissinger in 1973, 
Lehrer commented: “Political 
satire became obsolete.” And 
in 2002 he remarked, still less optimisti- 
cally: “Things I once thought were funny 
are scary now. I often feel like a resident 
of Pompeii who has been asked for some 
humorous comments on lava.” About the 
political earthquakes triggered by US Presi- 
dent Donald Trump, Lehrer has been silent. 

As for his songs, their vigour, concision, 
melodic variety and humour never stale. 
Although Lehrer is absurdly omitted from 
the Encyclopaedia Britannica (unlike his 
friend, the lyricist and composer Stephen 
Sondheim), his scathing creations remain 
one of the most original — not to mention 
mathematically elegant — bodies of 
artistic work to come out of the United 
States in the twentieth century. m 


Andrew Robinson is the author 

of Einstein: A Hundred Years of 
Relativity and The Last Man Who 
Knew Everything, a biography of 
Thomas Young. 

e-mail: andrew@andrew-robinson.org 


CORRECTION 

The exhibition review ‘Fake Views’ (Nature 
555, 442: 2018) erroneously described 
the artist Ryuta Nakajima as a woman; he 
isaman. 
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Hawking, Sulston 
and science in Europe 


As well as championing open 
and accessible research, physicist 
Stephen Hawking and biologist 
John Sulston were part ofa long 
tradition of British engagement 
with European science (for 
obituaries, see Nature 555, 444; 
2018 and Nature 555, 588; 2018). 
As chief scientific advisers to 

the European Commission, we 
feel strongly that this tradition 
must not end — irrespective 

of where the future takes the 
United Kingdom. 

Science for the greater public 
good depends on openness of 
mind, of spirit and of borders. 
Sulston and Hawking did much 
to uphold these ideals and to 
promote the importance of 
basing policy decisions on 
strong scientific evidence. 

Both recognized that society 
benefits from integrated 
scientific endeavour. Indeed, 
European projects built on 

this premise — such as the 
intergovernmental research 
organizations CERN (Europe's 
particle-physics laboratory near 
Geneva, Switzerland) and the 
European Molecular Biology 
Laboratory — have strengthened 
science by stimulating the 
movement of ideas across the 
continent and beyond. 

These great scientists shared a 
strong sense of social decency and 
encouraged a profound respect 
for expertise, each of which are 
more important now than ever. 
Rolf Heuer, Paul Nurse 
European Commission, Brussels, 
Belgium. 
rolf-heuer@cern.ch 


Nobel principles hold 
true after 123 years 


Nils Hansson and colleagues 
suggest that Nobel committees 
in 1901-66 were persuaded 

to award the Nobel Prize in 
Physiology or Medicine based on 
the potential research impact of 
a single discovery or innovation, 
rather than on a distinguished 


research record (Nature 555, 

311; 2018). As secretary general 
of the Royal Swedish Academy of 
Sciences, I can confirm that this 
is still the case. 

The Nobel prize is not a lifetime 
achievement award. In his last will 
of 1895, Alfred Nobel stipulated 
that the Physiology or Medicine 
prize should go to “the person 
who shall have made the most 
important discovery within the 
domain’; in physics should be for 
“the most important discovery or 
invention within the field”; and in 
chemistry should be awarded for 
“the most important chemical 
discovery or improvement”. 

It is reassuring that the 
assessment of hundreds 
of nominations by Nils Hansson 
(no relation of mine) and 
colleagues confirms that past 
committees have rigorously 
upheld Nobel's will. 

Goran K. Hansson The Royal 
Swedish Academy of Sciences, 
Stockholm, Sweden. 
goran.hansson@kva.se 


How philosophy was 
squeezed out of PhD 


Gundula Bosch’s argument for 
putting the philosophy back 

into the PhD is a breath of fresh 
air (Nature 554, 277; 2018). It is 
interesting to look back and see 
how broad critical thinking came 
to be eased out of the doctorate, 
squeezing academic enquiry into 
narrow disciplines. 

The process started in the 
early 1970s in the United States, 
prompted bya suspicion that 
intellectual artefacts of the ‘soft’ 
sciences, as they were then 
called — such as sociology, 
anthropology and philosophy — 
were stimulating campus unrest. 

This conveniently dovetailed 
with the idea that if industry 
outsourced its research and 
development departments 
to universities by setting (and 
funding) curricula, then students 
would have ready-made jobs in 
industry on graduation. These 
mechanistic conceits looked 
good on paper and fitted well 


with reductionists’ educational 
metrics. However, they all but 
killed students’ curiosity for 
serendipitous scientific enquiry. 

My father designed stellar- 
inertial guidance systems for 
reconnaissance aircraft and, after 
he retired, would often present his 
work to physics and engineering 
students. When they asked him 
what they should study to prepare 
for such a career, he would reply: 
“Read the classics,” by which he 
meant Aristotle, Ralph Waldo 
Emerson, Jean-Jacques Rousseau 
and Blaise Pascal. 

The best scientific and 
technical progress does not come 
out of a box. It is more likely to 
emerge from trying to fit wild, 
woolly and tangential ideas into 
useful societal and economic 
contexts. 

Michael Stocker Ocean 
Conservation Research, 
Lagunitas, California, USA. 
mstocker@ocr.org 


Sciences unite for 
Spain’s prosperity 


As Spain's economy recovers, the 
strategic application of science 
could help to stimulate prosperity 
and to attract much-needed 
investment. In an unusual move 
in a world of specialization, 
the Spanish scientific community 
has formed a meritocratic, 
all-sciences advisory council 
within the Gadea Foundation for 
Science in Madrid, a non-profit 
body of leading scientists that 
works to improve Spain's science 
system. The council’s aim is to 
galvanize politicians and the 
public into promoting research 
that will ensure social progress 
(see www.gadeaciencia.org). 
Spain ranks ninth in the world 
for scientific production and 
has 58 scientists in the 2017 
Clarivate Analytics Highly Cited 
Researchers list (see go.nature. 
com/2j77ctb). The application of 
research results for the benefit of 
society is still disturbingly low, 
however, owing to meagre public 
support and too few industries 
based on science and technology. 


The advisory council’ first 
forum was held in October 2017 
to develop a strategy for 
improving this situation. It was 
framed around four cornerstones: 
health, life sciences (including 
philosophy, mathematics and 
astrobiology), Earth (including 
materials and water, food and 
energy, and climate change 
and biodiversity) and society 
(including science policy and the 
economy). The forum's founding 
declaration emphasizes the 
importance for advancing society 
of knowledge, training, talent and 
academic-industrial interaction 
in all of these areas. Our view 
is that science is not just for 
scientists — it isa human right. 
Fernando Baquero, Jose 
A. Gutiérrez-Fuentes Gadea 
Foundation for Science, Madrid, 
Spain. 
ja.gutierrezfuentes@gadeaciencia. 
org 


Encouraging trend 
in US astronomy 


Aswin Sekhar remarks on 

the low proportion of female 
astronomers in many countries 
(Nature 555, 165; 2018). 

A career in science can often 
exceed 50 years, meaning that 
the total average number of 
women (and minorities) will 
remain low for another half 

a century, even if we achieve 
parity now among early-career 
scientists. What matters as much 
as where we've been is where 
were going. 

As the chair of the 
International Astronomical 
Union's US committee for 
membership applications, I can 
report that we are doing much 
better than the grand averages 
would suggest. Women comprise 
around 40% of the latest US 
intake of 212 individuals, with 
43% of those having gained 
their PhDs after 2010. This is 
an encouraging trend. 

David Soderblom Space 
Telescope Science Institute, 
Baltimore, Maryland, USA. 
drs@stsci.edu 
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OBITUARY 


Giinter Blobel 


(1936-2018) 


Biologist who decoded how proteins are sorted in cells. 


or much of the twentieth century, 
isc puzzled over how the pro- 

teins that build, run and leave cells get 
where they need to be. Over five decades, 
biologist Ginter Blobel hammered out the 
answer: the ‘signal hypothesis, a targeting 
system resembling a set of postal codes. It 
earned him the Nobel Prize in Physiology or 
Medicine in 1999. 

The steps to and beyond this discovery 
helped to explain normal cell organization, as 
well as many diseases in which the transport 
of proteins is defective. The signal hypothesis 
also laid the foundations of modern biotech- 
nology. Proteins such as human insulin and 
human growth hormone could be tagged for 
mass production in bacteria, secreted and 
easily collected. 

Blobel’s pathway to the prize began at the 
Rockefeller University in New York City, 
where he spent 50 years. As a postdoc in 
George Palade’s laboratory during the late 
1960s, he studied the secretory pathway 
taken by proteins set to be extruded from 
the cell. He and cell biologist David Sabatini 
found that newly minted proteins could not 
be digested as they emerged from the ribo- 
somes that synthesize them. The pair con- 
cluded that the proteins entered the secretory 
pathway as they were made. 

Blobel and Sabatini proposed that these 
proteins have a distinctive region — their 
postal code — that is recognized by special 
chaperones that guide the part-synthesized 
protein and its ribosome to a cellular orga- 
nelle called the endoplasmic reticulum. 
There, protein synthesis is continued as the 
protein crosses into the organelle and enters 
the secretory pathway. 

Many advances in our understanding of 
metabolism come from breaking cells into 
their components, and then reassembling 
the parts to work out which component does 
what. Blobel used this approach to test his 
hypothesis. He mixed ribosomes, the mes- 
senger RNA that encodes proteins, cytosol 
and membranes in vitro. His experiments 
failed repeatedly. 

But he never gave up. He continued mixing 
components from different organisms, con- 
vinced that, like the alchemists, ifhe had just 
the right ratios or sources of ingredients, he 
could create something new. Finally, in 1975, 
after four years in the cold room, he got a 
clear result: his nascent protein seemed to 
target and enter the secretory pathway. 

As with most research, there was not one 
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‘aha moment. Blobel fretted about ugly facts 
that could destroy his beautiful hypoth- 
esis. Perhaps the protein did not enter the 
secretory pathway, it was just stuck on the 
outside? Maybe the tests he was using to 
determine whether the protein entered the 
pathway were faulty? Two papers, published 
with cell biologist Bernhard Dobberstein in 
the Journal of Cell Biology in 1975, set out 
crucial experimental evidence amid a flurry 
of controls. 

The assays Blobel had developed using 
mainly mammalian components were 
quickly used to identify the signals that dis- 
patch proteins to other cell compartments — 
such as mitochondria, and to chloroplasts 
in plant cells — and to identify similar sys- 
tems in yeast and bacteria. The assays were 
applied to proteins to be secreted and those 
that would end up spanning membranes. 
The system was highly conserved: a mix- 
ture of human mRNA encoding a secretory 
protein, ribosomes from rabbit blood and 
membranes from yeast could synthesize the 
human protein. The impact of Blobel’s work 
cannot be overstated. 

Blobel’s career was filled with audacious 
speculations. “I always imagined how things 
would be,” he once said. For him, the thrill 
was the experimental chase. Yet he cau- 
tioned trainees to be ready to abandon their 
fantasies, “when data come which aren't 
compatible”. His speculations concerned the 
regulation of gene expression, the channels 
through which proteins crossed membranes 
and the pores that are the gateway into the 
cell’s nucleus. 


When necessary for his work, he learnt 
yeast genetics, modern structural biology 
and electrophysiology. Inured to the ridicule 
of his speculations, he once said, “There is 
an internal revolt in me against conforming. 
After the war, my family lived in East Ger- 
many and that taught me that truth is the 
most holy and important thing in life.” 

Blobel grew up in the Silesian village of 
Waltersdorf, where his father was a veterinary 
surgeon. As the Second World War was draw- 
ing to its end, when Blobel was 9, his family 
fled west, passing through Dresden. It was his 
first exposure to a big city. Asan adult, he often 
recalled being entranced by the many spires 
and the magnificent cupola of the Frauen- 
kirche. A few days later, he saw fire-bombing 
bring the church and city to ruin. 

Blobel studied medicine in Germany, 
but grew frustrated that it treated symp- 
toms rather than the causes of disease. 
He emigrated to the United States to doa 
PhD with Van R. Potter at the University of 
Wisconsin-Madison. There he succeeded in 
isolating the nucleus, which he investigated 
throughout his career with infectious enthu- 
siasm. In his last years in the lab, he proposed 
large changes in the structures of the pores 
that regulate the passage of material in and 
out of the nucleus. He dubbed these changes 
‘the ring cycle, in tribute to the Wagnerian 
operas he enjoyed on rare outings from the 
Rockefeller campus. 

Blobel was passionate about 
architecture — of cities as well as of cells. He 
donated his US$960,000 Nobel winnings to 
two extraordinary projects: the rebuilding of 
the bombed Frauenkirche, and of the Dres- 
den synagogue, destroyed on Kristallnacht. 

Debonair and jovial, Blobel appreciated 
repartee and a well-told tale. His wife, Laura 
Maioglio, was a frequent participant in Blobel 
lab functions and scientific affairs at Rock- 
efeller. And Blobel held court in her famed 
restaurant on Broadway, Barbetta, debat- 
ing literature, design, music and, of course, 
science. There, with his shock of white hair 
and emphatic gestures, he did recall the 
alchemists of old. Fittingly, his wizardry 
transformed our understanding of the cell. m 


Sanford Simon is professor of cellular 
biophysics at the Rockefeller University in 
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Figure 1 | Holding back the flood. In 2011, the muddy floodwaters of the Mississippi were prevented from inundating Arkansas agricultural land (left) by 
flood defences; green areas on the right are forested regions on the floodplain. Munoz et al.' present a palaeoflood record for the Mississippi River that spans the 
past 500 years, and conclude that levee building and channelization have made destructive floods larger and more common than before. 
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Mississippi rising 


The Mississippi River is shackled by one of the world’s largest systems of flood control. A palaeohydrological record 
suggests that those measures might actually be making floods worse. SEE LETTER P.95 


SCOTT ST. GEORGE 


he Mississippi River was once thought 
Te be uncontrollable: no feat of engi- 

neering could prevent the river from 
bursting its banks and sending floodwaters 
across its natural domain. But since the late 
nineteenth century, an expanding system of 
levees, floodways and channel modifications 
(Fig. 1) has gradually brought the river to 
heel, largely confining its waters to the main 
channel and accelerating them downstream. 
On page 95, Munoz et al.' conclude that those 
same control measures have inadvertently 
raised the threat of flooding in the lower Mis- 
sissippi to a level that is unprecedented in the 
past five centuries. 
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Some engineers and hydrologists have 
contended’’ that modifications to the 
Mississippi, and particularly the construction 
of the extensive levee system, have raised the 
height of the water and increased the volumet- 
ric flow rate (discharge) during major floods. 
The earliest available report of flooding on the 
Mississippi’ described an inundation in May 
1543, but the first permanent stream gauge 
was not established on the river until 1897, at 
Vicksburg, Mississippi. The discharge record 
from this gauge, and from several others 
in the river’s catchment area, now includes 
more than a century’s worth of observations. 
Nevertheless, because the most destructive of 
floods — suchas the great floods that occurred 
in 1927, 1993 and 2011 — are relatively rare, 


the perspective offered by direct hydrological 
measurements is still too narrow to determine, 
with certainty, whether the hazard posed by 
the river is changing. 

Munoz and colleagues have worked around 
that limitation by building their own extended 
record of flooding on the Mississippi, using 
evidence of high waters preserved by oxbow 
lakes and trees. An oxbow lake forms when 
a meander ofa river is cut off to form a free- 
standing body of water. When such lakes are 
inundated by floods, they act as natural sedi- 
ment traps for sand and silt carried in flood- 
waters. As those particles settle, they create a 
layer of coarse sediment on the bottom of the 
lake that is distinct from the clay and fine silt 
left behind when the lake is not hydrologically 
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connected to the main channel®. Prolonged 
inundation with floodwater can cause some 
species of tree — particularly oak — to form 
wood that has abnormal features. The annual 
growth rings of such trees in the ‘bottomland’ 
hardwood forest of the Mississippi floodplain 
therefore contain a natural record of past 
floods’. By splicing together sedimentary 
sequences from lakes in Louisiana and Missi- 
ssippi with ‘flood-ring’ signatures from living 
and dead trees in southeastern Missouri, the 
authors assembled a flood chronology for the 
lower Mississippi that stretches back to the 
early sixteenth century. 

Together, these natural archives have kept 
a remarkably faithful account of past floods. 
The tree rings mark the occurrence of the great 
floods of 1844 and 1927, as well as ’Année des 
Grandes Eaux (the Year of the Great Waters) 
in 1785, which destroyed French-Canadian 
settlements in Illinois and Missouri. It is more 
difficult to date individual floods in the lake 
records because of the lower chronological 
resolution of that archive, but layers of coarse 
sediment can be matched to the great floods 
of 1851, 1927 and 2011, as well as to the flood 
reported by Spanish conquistadors in 1543. 
Overall, this new palaeoflood record suggests 
that, although flood hazards have waxed and 
waned through time, the Mississippi has risen 
higher and flooded more frequently in the past 
century than during any other period in the 
past 500 years. 

The authors propose a provocative expla- 
nation for this recent hydrological intensifica- 
tion. Since the start of the twentieth century, 
records gathered using instruments show that 
the discharge of the Mississippi has slowly 
risen and fallen in concert with the surface 
temperature of the North Atlantic Ocean, 
which has alternated every two or three dec- 
ades between warm and cold states’. Munoz 
and colleagues draw on proxy temperature 
estimates for the North Atlantic® to demon- 
strate that this dependency has held steady 
since the 1500s. Because the spate of major 
floods in the past century cannot be explained 
by the observed temperature behaviour of the 
North Atlantic, the authors conclude that the 
trend towards larger and more frequent floods 
is mostly due to the transformation by humans 
of the Mississippi River and its basin. 

Blaming floods on the infrastructure that 
was built to guard against them will be contro- 
versial. The Mississippi basin has undergone 
upward trends in precipitation and evapo- 
transpiration — the sum of evaporation and 
plant transpiration from Earth's surface — in 
the past several decades’, so climatic factors 
other than the influence of the North Atlan- 
tic might also have affected the rhythm of the 
river. And, like the rest of North America, the 
Mississippi basin has warmed substantially 
since the end of the Little Ice Age”’ (a period 
of cooling that began in the sixteenth century 
and ended in the mid-nineteenth century”), so 


I think it is possible that the long-term trends 
in hydrology could be the result of climate 
change, rather than river engineering. Test- 
ing these competing explanations will require 
more palaeoflood work to be performed along 
the upper Mississippi and its main tributaries, 
where modifications to the river are less inten- 
sive and the climate still dominates the river’s 
hydrology”. 

In the meantime, Munoz and co-workers’ 
study makes it clear that a century or so of 
hydrological readings is not sufficient to take 
the measure of a river such as the Missis- 
sippi. Their palaeoflood record is compelling 
because it offers an opportunity to step back 
and consider the ebb and flow of the river 
on a timescale that befits its majesty. And if 
the authors are correct, and collective efforts 
to subdue the Mississippi have inadvertently 
pushed it to rise higher than ever, then the 
time might have come to consider loosening 
its restraints. m 
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Forests in flux as 
climate varies 


How do changes in climate affect forest ecosystems? A study of temperate 
forests in the United States has assessed alterations in biomass and tree-species 
composition across a 20-year period of climate variability. SEE LETTER P.99 


SEBASTIAAN LUYSSAERT 
& J. HANS C. CORNELISSEN 


ocumenting and understanding 
D changes induced by climate in 
the composition and function of 
vegetation is essential for planning adaptation 
strategies, because chances to intervene do not 
arise often for forests that contain trees with 
long lifespans. Moreover, most of the effects 
exerted by climate change on the composition 
and biomass of vegetation probably occur 
incrementally rather than abruptly’, which 
makes their detection a challenge. On page 99, 
Zhang et al.’ report an analysis of changes in 
forest biomass in the eastern United States over 
two decades, during a time when some regions 
became drier and others became wetter. 
Ecological studies have long focused on 
analyses in which the main groupings for 
organisms being studied are determined by 
evolutionary relationships such as belong- 
ing to the same genus. However, such kinship 
can hide functional differences. For example, 


even among closely related species of oak 
tree (Quercus spp.), some will thrive under 
moist conditions, whereas others are more 
suited to dry climates. In the past decade, the 
use of functional rather than kinship-driven 
approaches to grouping has provided many 
important insights’, and Zhang and colleagues’ 
work can be added to the list of studies that 
have successfully done this. 

Zhang et al. sought to investigate whether 
shifts in climate affect forest characteristics 
such as the prevalence of drought-tolerant 
species and the total biomass of the tree pop- 
ulation. The authors compared temperate- 
forest inventory data’ gathered during the 
1980s and the 2000s. This inventory includes 
surveys of around 100,000 plots, in which 
data such as species name and a standardized 
measurement of tree diameter were recorded 
for roughly 3 million trees. Diameter meas- 
urements allow the amount of biomass that 
is present above the surface of the ground to 
be estimated for a particular tree on the basis 
of previous studies of growth patterns for 
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Mesic 
tree 


Drought-tolerant 
tree 


1980s 


Forests reassessed 
20 years later, after 
a period of drought 


— 


Model a 
Mortality of some mesic trees 


Model b 
Drought-tolerant-tree growth is less 
affected than mesic-tree growth 


Model c 
Sapling established 


Figure 1 | Changes in the drought tolerance of tree populations. Zhang et al.’ used an inventory’ of 
trees growing in the eastern United States in the 1980s and the 2000s to estimate how an intervening 
period of drought had affected the contribution of drought-tolerant species to total biomass, in forests that 
had reached a given age. This analysis revealed an increase in the contribution of drought-tolerant trees 
compared with that of moisture-needing (mesic) trees to the population-level biomass. Three models 
could account for such an increase. Model a is consistent with the results for forests composed of trees 
more than 80 years old. Model b can account for the patterns observed by the authors in most other forests. 
The study’s time span is probably too short to fully capture the sapling emergence of model c. The drought 
tolerance of trees might partly depend on their entering into symbiotic interactions with root-colonizing 
fungi that extend the effective root length’. Such interactions could reduce tree biomass. If so, this might 


need to be considered in studies of this type. 


a given species. Such analyses enabled the 
authors to assess the aboveground biomass 
per hectare for tree populations, and also 
to investigate whether changes occurred in 
the relative contribution of particular types 
of species, such as drought-tolerant trees, 
to the total aboveground biomass. 

The authors analysed the data grouped into 
grids of cells that each covered an area of one 
degree of latitude by one degree of longitude 
(about 110 kilometres by 85 kilometres). The 
plots sampled in the 1980s and the 2000s were 
not identical; however, the authors were able 
to check their findings using a subset of plots 
that had been resampled and found that their 
conclusions remained the same. The authors 
determined the tree-age composition of the 
forests and assigned them into 20-year-inter- 
val age brackets. This enabled comparisons to 
be made between similar types of forest, for 
example, comparing 20-40-year-old forests in 
the 1980s and in the 2000s, therefore avoiding 
possible confounding factors such as changes 
in drought sensitivity that are linked to the 
increase in tree height as trees age”. 

The authors assigned a numerical score to 
each recorded species of tree that represented 
its drought-tolerant characteristics. This 
score was generated on the basis of informa- 
tion about water availability, derived from 
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measurements of the annual precipitation at 
grid cells where the species was found, and 
the minimum water requirements of that spe- 
cies. For each plot, the authors calculated the 
average drought tolerance of its tree popula- 
tion by using the species’ drought scores, also 
taking into account the relative abundance 
of each species. The authors then compared 
how drought tolerance at the tree-population 
level for a particular age class in a given grid 
cell changed in the roughly 20 years between 
inventories, and assessed whether these 
changes mirrored any changes in soil mois- 
ture (estimated by the Palmer drought severity 
index) at each location. 

Despite the occurrence of unavoidable 
problems such as logging during the data- 
collection period, Zhang and colleagues build 
a strong case that, between the 1980s and the 
2000s, the effects of ongoing climate varia- 
tion in this zone of modest regional climate 
change have resulted in detectable changes in 
forest composition and biomass. In areas in 
which soil moisture increased over time, the 
authors observed a population-level decrease 
in drought tolerance and a population-level 
increase in aboveground biomass per hectare. 

The most striking finding made by Zhang 
and colleagues was that, in areas in which 
soil moisture had decreased, a decrease in the 


average aboveground biomass gained per tree 
was accompanied by an increase in popula- 
tion-level drought tolerance. This increased 
share of drought-tolerant species occurred 
because the decrease in the growth rate of 
moisture-needing (mesic) species during a 
drought is greater than the decrease in growth 
rate of drought-tolerant species. Consequently, 
the forests that reached the particular age 
range in the 2000s have a lower biomass and 
a larger proportion of drought-tolerant spe- 
cies than the forests that reached this age range 
in the 1980s. 

In forests containing trees more than 
80 years old that experienced drought, the 
population-level changes in drought tolerance 
observed by the authors were often driven by 
an increase in the mortality of mesic species 
(modela in Fig. 1). The role of preferential tree 
mortality in driving forest changes that are 
linked to climate variability has already been 
reported in a study in Europe’. Yet, for most 
age ranges, the authors found that the tree 
population became more tolerant to drought 
because the growth of the drought-tolerant 
species was less affected than was that of the 
mesic species. This means that the drought- 
tolerant trees increased their proportional 
contribution to the aboveground biomass 
per hectare in the study’s two-decade period 
(model b in Fig. 1). 

Although the establishment of saplings 
probably contributes to the response of forests 
to climate variability (model c in Fig. 1), the 
short time span of the study, the fact that forest 
inventories commonly measure only individ- 
ual trees above a threshold size that it can take 
a decade or more for a sapling to reach, and 
the relatively small contribution of saplings to 
the total aboveground biomass for a given hec- 
tare of established forest, make it unlikely that 
Zhang and colleagues’ approach would capture 
fully the changes in species composition that 
are due to sapling emergence. 

The authors suggest that their results and 
observations could have relevance for how 
climate is affecting other temperate forests 
or forests in other climate zones. Yet perhaps 
the priority should be to unravel the mecha- 
nisms that underlie the connection between 
drought tolerance and biomass production at 
the species level. Zhang and colleagues sug- 
gest that the low availability of water should 
favour tree species that allocate a greater pro- 
portion of their biomass to fine roots, thereby 
promoting drought tolerance at the expense of 
aboveground biomass production. However, 
this might be only part of the climate-response 
phenomenon. 

We speculate that, compared with mesic 
species, drought-tolerant species might have 
greater investments in symbiotic relation- 
ships with soil-dwelling mycorrhizal fungi 
that can colonize tree roots. Such interactions 
can extend the effective total root length, 
thereby extending access to water and nutrients 
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in the soil’. However, such connections would 
probably come at a substantial cost in terms of 
tree-biomass reduction because of the need to 
divert sugars to fungal partners. The type and 
abundance of mycorrhizal symbioses vary with 
soil type and climate®”, so if fungal symbiosis 
is a major consideration in these scenarios, 
such factors would need to be considered in 
future implementations of the approach used 
by Zhang and colleagues. 

There is a pressing need to understand the 
relationship between water availability and the 
drought tolerance and biomass of forests. It is 
necessary, therefore, to ask whether the types 
of change that the authors observed would be 
able to keep pace with climate changes that 


CONDENSED-MATTER PHYSICS 


occur on longer timescales. For example, 
will the drought-tolerance capacity of today’s 
saplings suffice for the conditions that these 
trees might encounter when they reach matu- 
rity? It’s high time to knock on wood that it 
will, as well as to continue to investigate the 
mechanisms that affect forest ecosystems in a 
changing climate. m 
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Novel electronic states 
seen in graphene 


Asimple system made from two sheets of graphene has been converted from an 
insulator to a superconductor. The finding holds promise for opening up studies of 
an unconventional form of superconductivity. SEE ARTICLE P.43 & LETTER P.80 


EUGENE J. MELE 


the discovery of new electronic ground 

states in twisted bilayer graphene — a pair 
of single-atom-thick sheets of carbon atoms, 
stacked with their honeycomb lattices rotated 
out of alignment. The authors interpret one 
of these states” (page 80) as a correlated Mott 
insulator, a non-conducting state produced 
by strong repulsive interactions between 
electrons. The other’ (page 43) is a super- 
conductor, a state of zero electrical resistance 
produced by effective attractive interactions 
between electrons. The insulator turns into 
the superconductor when a small number 
of charge carriers are added to the graphene. 
This connection between the states is unlikely 
to be a coincidence — as Sherlock Holmes 
might have commented, “the universe is rarely 
so lazy”. 

Cao et al. show that the stacking of graphene 
sheets allows access to a new family of mater- 
ials with electronic behaviours that are 
exquisitely sensitive to the atomic alignment 
between the layers, which affects interlayer 
electron motion. This finding might sur- 
prise physicists, because electronic behaviour 
is usually dominated by whichever of the 
associated processes has the largest energy 
scale. But, in this case, there’s a conundrum: 
the energy associated with electron motion 
between atoms within a layer is of the order of 
electronvolts, whereas the energy for electron 


I: two papers in Nature, Cao et al.'* report 


motion between layers’ is, at most, hundreds 
of millielectronvolts. 

The resolution to this conundrum is a 
matter of symmetry. Well-prepared, single 
layers of graphene are highly ordered systems 
whose electronic properties are determined by 
a subtle symmetry, which is encoded in a solid- 
state version of the Dirac equation describing 
low-energy excitations. These excitations are 
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sensitive to interlayer couplings that alter the 
symmetries of the stack. 

Interactions between electrons in these 
excitations can produce forms of matter 
generically described as being strongly corre- 
lated. A well-reasoned strategy for discover- 
ing such forms of matter has been to restrict 
intralayer electron motion by applying a strong 
magnetic field*. This generates narrow elec- 
tron energy bands (Landau levels) in which 
electron-electron repulsion can control the 
physics of the graphene bilayer. 

Cao et al. have taken a simpler tack to 
discover strongly correlated states. They used 
the rotational misalignment of graphene sheets 
to tune twisted bilayer graphene into a regime 
in which interactions between electrons can 
dominate the electronic states of the system. 
Such rotational misalignment forces the elec- 
tronic band structures in the two sheets out 
of alignment and enlarges the bilayer’s unit 
cell (the smallest repeating unit of the crystal 
lattice) (Fig. 1a). For large rotations, the first 
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Figure 1 | The effects of rotation in twisted bilayer graphene. a, When a graphene bilayer is twisted so 
that the top sheet is rotated out of alignment with the lower sheet, the unit cell (the smallest repeating unit 
of the material’s 2D lattice) becomes enlarged. For large rotations, the electronic band structures of the 
two graphene sheets are also rotated out of alignment (not shown). b, For small rotation angles, a ‘moiré’ 
pattern is produced in which the local stacking arrangement varies periodically. Cao et al.” have observed 
that, for rotation angles of less than 1.05°, regions in which the atoms are directly above each other (the 
lighter regions in the pattern) form narrow electron energy bands, in which electron ‘correlation effects 
are enhanced. This results in the generation of a non-conducting state” (a Mott insulator), which can be 
converted into a superconducting state’ if charge carriers are added to the graphene system. 
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effect completely dominates, and electron 
motion between layers is suppressed by a 
kinematic barrier’. 

However, at very low rotation angles, a 
moiré pattern is produced by the misaligned 
lattices (Fig. 1b); the unit cell is greatly 
enlarged and so the effects of this come into 
play®”. The misalignment of the band structure 
essentially disappears, and theory predicts that 
the low-energy electronic states are completely 
reconstructed’. Coupling between electrons 
in the different layers becomes strong, and 
new narrow bands emerge at certain ‘magic’ 
rotation angles below 1.05° when the bilayer 
system is close to charge neutrality. Electrons 
in these narrow bands are found mainly in 
regions of the moiré pattern in which the 
atoms are stacked directly above each other 
(the light regions in Fig. 1b). In these cir- 
cumstances, the bilayer can be thought of as a 
synthetic, triangular lattice of weakly coupled 
quantum dots (tiny semiconductor particles 
that bind electronic states) with a residual 
tunnelling of electrons between them’®. 

Cao et al. fabricated twisted bilayer graphene 
so that the sheets are rotated at magic angles, 
and accumulated or depleted charge carriers in 
the system to study how the charge-transport 
properties of the system depend on the filling 
of the energy bands. The authors observed’ 
strong insulating behaviour when each unit 
cell of the synthetic lattice contained four 
charge carriers, a density that corresponds 
to complete filling of the bands. Intriguingly, 
they also find evidence for additional insu- 
lating states at lower densities in which the 
number of carriers per unit cell is an integer, 
but for which the narrow energy bands of the 
system are fractionally occupied. This suggests 
that the additional states are Mott insulating 
states, in which free motion of the carriers is 
prevented by their mutual repulsion, produc- 
ing gridlock on the lattice. Mott insulators are 
a strongly correlated, non-conducting form 
of matter. 

Even more intriguing is what happens when 
charge carriers are added to the Mott-insulator 
states associated with half-full unit cells of the 
synthetic lattice. The authors observe’ that 
the system enters a state that has zero electri- 
cal resistance below a critical temperature of 
approximately 1.7 kelvin, in a phase change 
known as a Berezinskii—Kosterlitz—Thouless 
transition, thus forming a 2D superconductor. 
This transition temperature is remarkably high, 
given the very low carrier density achieved 
in these measurements (10"' charge carriers 
per square centimetre). The high transition 
temperature and the apparent connection to 
correlated insulating states invites compari- 
son of this superconducting state to that of a 
family of ‘unconventional’ superconductors’, 
which also have a close relationship with other 
strongly correlated electronic ground states. 
Twisted bilayer graphene might therefore 
be a useful experimental system in which to 
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investigate the mechanism of unconventional 
superconductivity. 

In the meantime, Cao and colleagues’ 
discoveries will stimulate a wave of activity 
as scientists seek to unwind the microscopic 
basis for the reported striking phenomena. 
The findings also demonstrate the promise 
of using twisted bilayer graphene as a flexible 
and tunable platform in which correlated elec- 
tronic phenomena can be readily observed, 
and possibly even engineered and exploited”. = 
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Two-pore channels 


open up 


Two-pore channels span the membranes of acidic organelles inside cells. A 
structural and functional analysis reveals secrets about how these channels open 
to allow ions to pass across the membrane. SEE LETTER P.130 


SANDIP PATEL 


wo-pore channels (TPCs) are an 

ancient family of ion channels that are 

unusual because they are found, not at 
the cell surface, but spanning the membranes 
of acidic organelles such as endosomes and 
lysosomes. These organelles mediate biomol- 
ecule transport and breakdown, and serve as 
stores of calcium ions’ (Ca). TPCs are key 
for several organellar functions — releasing 
Ca” into the cytoplasm to control traffick- 
ing of material such as receptor proteins and 
viruses, for instance, and stabilizing junctions 
with other organelles’’. They are increas- 
ingly being associated with disorders such as 
Parkinsons disease, and are therefore emerg- 
ing as potential therapeutic targets’. Detailed 
structural information is scant, but advances in 
cryo-electron microscopy are revolutionizing 
our ability to study ion channels. On page 130, 
She et al.” use this technique to provide the first 
detailed view of an animal TPC. 

Previous work** has reported the atomic 
structure of a plant TPC. This consists of two 
subunits, each containing two similar trans- 
membrane domains (6-TMI and 6-TMII) 
connected by a large cytoplasmic linker. 6-TMI 
and 6-TMII are in turn each made up of six 
membrane-spanning regions, dubbed S1-S6. 
The pore through which ions flow is formed by 
S5 and S6 from each transmembrane domain 
in each subunit. 

She et al. resolved the structure of mouse 
TPC1. Their results revealed that the overall 


folding of this channel is, as expected, like that 
of plant TPC. Nonetheless, there is a surprising 
degree of structural conservation between the 
linkers, given that animal and plant TPCs 
have very different amino-acid sequences in 
this region. 

There are some structural differences, 
however. In plant TPC, the linker binds Ca** 
to help open the channel*”. But Ca™ binding by 
mouse TPC1 is unlikely, because amino acids 
essential for this interaction are missing. And 
the authors show that the carboxy-terminal 
domain of mouse TPC1, which is longer than 
the equivalent domain in plant TPC, forms 
a horseshoe-shaped arrangement of four 
helices that makes direct contact with the 
linker. This animal-specific feature probably 
serves to fine-tune channel activity. 

Activation of animal TPCs is complex and 
multifaceted. These channels were originally 
identified®’ as the targets for a messenger 
molecule called NAADP, which releases 
Ca” from acidic organelles*. Subsequent 
work revealed’ that TPCs are also activated 
by the lipid PI(3,5)P,. In addition, TPC1 is 
regulated by changes in voltage across the 
organelle membrane". She et al. demon- 
strated that both PI(3,5)P, and voltage changes 
are required to open TPC1; neither alone is 
sufficient (they did not examine NAADP). 
The authors then resolved structures of 
TPC1 in both the absence and presence of 
PI(3,5)P,, giving insight into the structural 
transitions that occur during channel opening. 
This analysis produced two key findings. 
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First, the group pinpointed the PI(3,5)P, 
binding site, which lies in 6-TMI (Fig. 1). Muta- 
tion of any one of several amino-acid residues 
in the network that forms this binding site can 
prevent TPC] activation by PI(3,5)P,. Interest- 
ingly, two of these residues — arginines ina 
short linker between S4 and S5 — are also 
required” for channel activation by NAADP. 
This suggests that PI(3,5)P, probably acts as 
a cofactor for NAADP action. Comparison of 
the free and PI(3,5)P,-bound forms of TPC1 
revealed that a single lysine residue in S6 
transmits conformational changes to the pore 
in response to PI(3,5)P, binding, thus directing 
the first stage of channel opening. 

Second, the authors found that changes 
in voltage are sensed by arginine residues in 
6-TMII (Fig. 1). Both 6-TMI and 6-TMII 
contain sequences in $1-S4 that are remini- 
scent of voltage sensors in other channels, 
but only 6-TMII has a specific helix in $4 that 
is required for voltage gating. The 6-TMII 
voltage sensor is in an upward, ‘activated’ form 
in both structures obtained by the authors — in 
this form, it can probably transmit changes to 
the pore, to which it is adjacent, completing 
opening of the channel. 

She and colleagues’ work on TPCs from 
animals, together with analyses*” of plant 
TPCs, indicate that both 6-TMI and 6-TMII 
cooperate to open the channel. 6-TMII is a 
target for voltage changes in both proteins. 
By contrast, 6-TMI is targeted directly by 
PI(3,5)P, in animal TPC1, and indirectly by 
Ca” in plant TPC. This is a prime example 
of how evolutionarily distant proteins have 
adapted to conserve a core function. 

Which ions pass through animal TPCs 


6-TMI 


once they open? Much research suggests that 
these channels are non-selective, like plant 
TPC, but some work indicates that they are 
selective for sodium ions!” (Na‘). She et al. 
found that TPC1 was about 70 times more 
permeable to Na* than to potassium ions (K’). 
Their structures reveal that the narrowest 
part of the pore through which ions are fil- 
tered is shaped like an oblong ‘coin slot’, con- 
stricted by specific asparagine residues. The 
authors provide evidence that these residues 
allow the small Na* ions through, but not the 
larger K" ions. This sieve effect is unlikely to 
explain the authors’ data indicating that TPC1 
apparently selects for Na* over Ca”*, because 
these ions are about the same size. However, 
the electrophysiological experiments used 
by the researchers to determine ion selec- 
tivity were performed under very different 
conditions from those in live cells, where 
the permeability of TPCs to Ca” is readily 
demonstrable”. 

In sum, She and colleagues’ structures 
provide major insight into how TPCs work. 
They join recently reported structures'*""° 
for a related family of ion channels, the TRP 
mucolipins (TRPMLs). Like TPCs, TRPMLs 
reside in acidic organelles, are activated by 
PI(3,5)P, and release Ca™* to control cellular 
functions such as gene transcription”. The 
PI(3,5)P. binding site in TRPMLs is probably 
in the protein’s amino-terminal region’®”” 
and is thus very different from that in TPCs, 
although it has yet to be directly observed. 

These rapid advances in the structural 
biology of organellar ion channels will aid 
future attempts to rationally design drugs that 
modulate ion flux through the channels. This is 


6-TMIl 
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Figure 1 | Structure of mouse TPC1. She et al.’ have resolved the structure of the channel protein 
TPC1, which is found at organelle membranes. TPC1 has two subunits, each of which contains two 


transmembrane domains (6-TMI and 6-TMII), connected by a linker. Square brackets indicate the top of 
the helices that make up 6-TMI and 6-TMII. Here, only one subunit is depicted. The authors found that 
channel activation requires the lipid PI(3,5)P, (blue), which binds to arginine amino-acid residues (red) 
in 6-TML, and voltage sensing through arginine residues (purple) in 6-TMIL. Activation results in the flow 
of ions through the central pore region into the cytoplasm. 
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50 Years Ago 


The University of Loughborough 

is already receiving encouraging 
response to its announcement last 
week ofa new type of vacation course. 
Three weeks of courses are being 
arranged in July for technologists, 
scientists, managers, teachers and — 
here lies the novelty — for their 
spouses. Provision is being made 

for children so that families can be 
together while parents catch up on 
some mid-career training ... Twenty- 
four technical courses during the 
three weeks will cover such subjects 
as optics, ultrasonics ... statistics and 
management ... Cultural courses 

for spouses ... will cover industrial 
archaeology, music, drama and new 
techniques of food production ... 
Accommodation is provided on the 
campus for families at reasonable 
rates ... While hoping to provide all 
the facilities of a holiday camp, the 
university believes that its vacation 
courses will be more valuable than 
the description “intellectual Butlins” 
implies. 

From Nature 6 April 1968 


100 Years Ago 


The recent development of aviation 
has provided a means of observing 
clouds which is much superior to 
any hitherto known. A modern 
aeroplane can reach the clouds ina 
very short time, and in many cases 
get above them. Observations of 
temperature can easily be obtained, 
and probably humidity observations 
would present no great difficulties. 
The “bumps” experienced also 

give some information as to the 
nature of the disturbance causing 
the formation of the clouds. It 

is well known that the two most 
important processes which cause 
clouds to form are (1) the mixture 
oflayers of air of high humidity and 
different potential temperature, 

(2) adiabatic expansion due to 
upward movement. 

From Nature 4 April 1918 
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pertinent as the number of diseases found to be 
associated with channel abnormalities grows. 
Mutations in TRPMLI cause a lysosomal 
storage disorder affecting children, and TPCs 
have been implicated in fatty liver disease, 
Ebola infection and several neurodegenera- 
tive disorders!”"*. In this context, a human 
TPC structure would be most welcome. 
Another challenge is to resolve the structure 
of TPC2. This protein is regulated by NAADP 
and PI(3,5)P., but not by changes in voltage — 
begging the question of how conformational 
changes in one TM domain are transmitted 
to the other to allow channel opening. No 


MICROBIOLOGY 


doubt, TPCs will reveal further secrets through 
forthcoming structures. m 
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Bacterial persister 
cells tackled 


Chronic infections can be hard to treat because slow-growing bacteria known 
as persister cells are usually unharmed by antibiotics. The identification of 
molecules that target such cells might provide a solution. SEE LETTER P.103 


JULIAN G. HURDLE & ADITI DESHPANDE 


he use of antibiotics to treat an infection 

can be unsuccessful when bacteria 

evade such drugs through genetic 
changes that endow them with antibiotic 
resistance. Pathogenic bacteria can also avoid 
antibiotic-mediated destruction through 
another route: some bacterial cells enter a met- 
abolically inactive or dormant state to become 
persister cells, which grow slowly or not at all. 
Most antibiotics were discovered in experi- 
ments that tested the ability of compounds to 
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Chronic infection 
with persister bacteria 


inhibit bacterial growth, and they are therefore 
often ineffective for treating non-growing 
persister cells’. On page 103, Kim et al.” now 
report the identification of small molecules 
that can kill persister cells. 

Persister cells are the source of many of 
the recurrent bacterial infections that affect 
people, for example those associated with 
implanted medical devices, such as the heart 
infection endocarditis, and also lung infections 
that can arise in cystic fibrosis’. Curing such 
chronic infections can require surgery, which 
places an added health burden on patients. 


Membrane distortion 
causes bacterial cell death 


Figure 1 | A retinoid compound destroys bacterial persister cells. a, Bacterial populations commonly 
consist of rapidly dividing cells and slow-growing cells that are known as persister cells. When bacteria 
are treated with conventional antibiotics, the rapidly dividing cells are destroyed. However, the persister 
cells can remain, giving rise to chronic infection. b, Kim et al? report a retinoid compound called 
analogue 2 that was optimized for targeting persister bacteria. In a mouse model of bacterial infection, 
the authors found that analogue 2 could kill persister cells. Electron microscopy and computer modelling 
revealed that analogue 2 probably binds to phospholipid molecules in the bacterial membrane, resulting 
in membrane distortion that might help to kill the bacteria. 
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And any necessary extended periods of 
treatment with antibiotics will increase the 
probability that bacteria evolve resistance. 
The development of treatments for killing 
persister cells is therefore urgently needed’, 
especially to target persisters that arise in infec- 
tions with strains of the ‘superbug’ bacterium 
methicillin-resistant Staphylococcus aureus 
(MRSA), which is resistant to several common 
antibiotics. Infection with MRSA is associated 
with illness and death, particularly among 
people with invasive infections’. 

Kim and colleagues decided to search for 
molecules that could offer protection from 
MRSA infection, using the roundworm Caeno- 
rhabditis elegans as a model system. Taking 
a high-throughput approach, the authors 
tested the ability of around 82,000 small 
synthetic molecules to protect worms from 
death mediated by MRSA infection. Of the 
185 compounds that conferred protection, 
the authors focused on two molecules called 
CD437 and CD1530, both of which kill MRSA 
cells rapidly and can also target Enterococcus 
faecium, a bacterium that is linked to endo- 
carditis. Unfortunately, these compounds had 
no effect against Gram-negative bacteria — 
a group that includes Escherichia coli — for 
which new therapeutic options are desperately 
needed because they, too, can form antibiotic- 
resistant superbugs. 

CD437 and CD1530 belong to a class of 
molecule known as the retinoids, which are 
structurally similar to vitamin A. Since the 
1960s, retinoids have been developed to treat 
various conditions, including acne®. Subse- 
quent synthetic modification of the retinoids 
has therefore generated derivatives that are 
often present in chemical libraries used in drug 
discovery. 

The authors concluded that prompt kill- 
ing of MRSA cells occurred when the two 
retinoid molecules distorted the structure of 
the bacterial membrane’s lipid bilayer. Kim 
and colleagues then carried out electron- 
microscopy studies, which revealed that the 
retinoid treatment caused curvature and fold- 
ing of the bacterial membrane but did not 
result in membrane destruction. 

The bacterial membrane is a permeability 
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barrier that is essential for many cellular 
functions’, and it contains proteins that are 
crucial to controlling the uptake of nutrients, 
the release of waste and the production of 
energy in the form of molecules of ATP. By 
distorting the membrane, the retinoid mol- 
ecules probably affect the import and export 
of solutes, as well as several other essential 
cellular functions that rely on membrane 
integrity. However, a mode of antibacte- 
rial action that involves simply attacking the 
bacterial membrane is not guaranteed to kill 
persister cells. For example, the authors report 
that the membrane-targeting antibiotics 
nigericin and valinomycin do not kill MRSA 
persister cells. 

Through computer simulations, the authors 
explored how the retinoid molecules might 
interact with the bacterial membrane. They 
determined that the polar side groups of 
CD437 and CD1530 could bind to the hydro- 
philic heads of phospholipids in the mem- 
brane, enabling the retinoid molecules to 
lodge in the lipid bilayer of a bacterium. Such 
simulations are a powerful tool that could be 
used to guide the optimization of antibiotics 
that can selectively attack the lipid bilayer of 
bacterial membranes without disrupting their 
mammalian counterparts and causing toxicity 
to patients. 

A major concern is how to optimize small 
molecules such as the retinoids to enable such 
selectivity. Although the authors found that 
CD437 and CD1530 did not destroy the lipid 
membranes of human red blood cells, the 
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molecules were able to kill human liver-cancer 
cells grown in vitro, which is consistent with 
the previously reported anticancer properties 
of the retinoids’. 

The authors generated structural variants 
of CD437, producing a compound they called 
analogue 2 that did not kill normal or cancer- 
ous human liver cells grown in vitro, but did 
retain the ability to kill MRSA persister cells 
(Fig. 1). In experiments in mice, analogue 2 
remained in circulation in the animals’ bodies 

for several hours at 


“Some bacterial high enough concen- 
cells enter a trations to kill MRSA 
metabolic ally persister cells, but did 
° ° not give rise to signs 
a of toxicity such as 
ta heceiic liver or kidney dam- 


age. Remarkably, the 
authors showed in 
mice that analogue 2 
could tackle what would generally be consid- 
ered to bea treatment-resistant form of MRSA. 
This animal model mimics chronic infection 
with MRSA in immunocompromised people, 
for whom the prognosis is poor with conven- 
tional antibiotic treatments such as vanco- 
mycin because of the large number of MRSA 
persister cells that are present*”. 

The authors found that the effects of 
analogue 2 on bacterial infections could be 
boosted by the presence of the antibiotic 
gentamicin, an inhibitor of bacterial protein 
synthesis that lacks activity against persister 
cells. It will be interesting to determine whether 


persister cells.” 


Observations of the 
birth of crystals 


Different forms of molecular crystals often have distinct properties, which can 
greatly influence their potential applications. A way of controlling the crystal 
form of a protein has now been reported. SEE LETTER P.89 


ROBERT G. ALBERSTEIN & F. AKIF TEZCAN 


r | he second law of thermodynamics 
dictates that all things tend towards 
disorder. Yet molecules and other 

microscopic particles in liquids frequently 

arrange themselves into perfectly ordered 
arrays — crystals — without violating this 
law. Moreover, a given molecule can often 
arrange itself into more than one type of array, 
producing different crystal forms known as 
polymorphs. These polymorphs can have 
remarkably different properties despite 
being composed of the same building blocks. 


On page 89, Van Driessche et al.’ report 
experimental observations of protein mol- 
ecules as they begin to assemble into clusters 
that then evolve into distinct polymorphs. 
Their findings bring fresh insight to the impor- 
tant processes of crystal formation and growth, 
and polymorph selection. 

The everyday consequences of crystal 
polymorphism are perhaps highlighted best 
by pharmaceutical drugs, most of which are 
administered as crystalline solids’. Poly- 
morphs of drug molecules often exhibit 
considerable variation in their ease of manu- 
facture, their shelf life and — crucially — their 
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MRSA persister cells respond to the retinoids 
by reactivating their cellular metabolism, 

thereby making them more susceptible to 
being killed by drugs such as gentamicin that 
would otherwise be ineffective. 

Molecules such as analogue 2 might be 
suitable candidates for drugs that decrease 
the time required to successfully treat chronic 
infections that are characterized by high loads 
of dormant bacteria. In an era in which the 
development of antibiotics is struggling to 
keep pace with the spread of resistant bacteria, 
the identification of compounds such as ana- 
logue 2 could help researchers to win victories 
in the long fight against bacterial infectious 
diseases. m 
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physical and chemical properties, which 
greatly influence their physiological efficacies”. 
The selection and manufacture of appropriate 
polymorphs is a major and costly component 
of the drug-development process, yet processes 
for polymorph selection are largely conducted 
on the basis of trial and error, rather than 
through molecular design. 

The development of rational approaches 
for the design and control of crystal growth, 
as well as for polymorph selection, requires 
an understanding of nucleation — the initial 
stages of crystallization, in which the building 
blocks begin to form clusters known as nuclei. 
Unfortunately, there are two main hurdles to 
capturing and characterizing crystals at birth. 
First, the nuclei are typically too small to be 
visualized in 3D space using most experimen- 
tal methods, especially when they consist of 
atoms or small molecules. Second, such nuclei 
are, by definition, unstable and therefore form 
only transiently. 

To address the first issue, Van Driessche and 
colleagues used the protein glucose isomerase 
(GI) asa building block, the box-like shape and 
nanometre dimensions of which make it rela- 
tively easy to identify using a technique called 
cryo-transmission electron microscopy (cryo- 
TEM). And to overcome the second issue, they 
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Figure 1 | Nucleation determines crystal growth and structure. Van Driessche et al.' have worked 

out how interactions between molecules of a box-shaped protein affect its nucleation (the formation of 
molecular clusters known as nuclei that act as ‘seeds’ for crystals) and, in turn, the growth and structure 
of crystals of the protein. Coloured semicircles represent sites of interaction on the molecules. a, Under 
conditions in which interactions between blue sites are stronger than those between red sites, rod-shaped 
nuclei form. Subsequent crystal growth occurs through the addition of either individual proteins or rods, 
and results in the formation of prismatic crystals. b, When the blue interactions are as strong as the red 
interactions, the nuclei have smaller aspect ratios than in a and grow equally in all directions through 

the addition of individual proteins. The resulting crystals are rhombic. c, Strong but indiscriminate 
interactions cause the molecules to form disordered clusters, which produce similarly disordered gels. 


(Adapted from a figure supplied by R. G. Alberstein.) 


used a protocol in which protein samples were 
rapidly frozen to about —183 °C, a temperature 
at which essentially all motion by GI molecules 
stops. This enabled them to take snapshots of 
the crystallization process at time intervals 
ranging from seconds to minutes. The authors 
thus captured the nucleation and growth of 
GI crystals with sufficiently high temporal 
and spatial resolution for them work out how 
the emergent crystal morphologies depend 
on specific interactions formed between GI 
molecules. 

To initiate crystallization, the authors mixed 
GI with ammonium sulfate or polyethylene 
glycol (PEG), which are commonly used as 
agents for modulating protein solubility and 
the strength of interactions between proteins. 
They observed that, at high concentrations of 
ammonium sulfate, GI molecules rapidly line 
up into rods that are a few proteins in length. 
These rods then align side by side, while also 
growing longer, to yield macroscopic crys- 
tals with a rectangular, prism-like shape that 
resembles the rod-shaped nuclei (Fig. la). By 
contrast, low concentrations of PEG cause the 
GI nuclei to grow more slowly and evenly in all 
dimensions, to yield rhombic crystals with a 
diamond-like shape (Fig. 1b). And at high con- 
centrations of PEG, the GI molecules become 
locked into structures that are best described 
as disordered gels rather than crystals (Fig. 1c). 
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Van Driessche and colleagues’ cryo-TEM 
images of the GI nuclei are detailed enough to 
be compared with known 3D atomic structures 
of GI crystals. Such comparisons enabled the 
authors to propose plausible models for the 
arrangement of GI molecules in the nuclei, as 
well as for the specific interactions between the 
molecules. The authors then designed mutants 
of GI in which a key amino-acid residue at 
each interaction site in the various nuclei was 
replaced with another residue. The mutant 
proteins were unable to form their corre- 
sponding polymorph and either produced the 
alternative crystal polymorph or aggregated 
into gels, depending on the conditions. These 
observations validated the proposed structural 
models and demonstrated that polymorphs 
could be selected predictably. 

Classical nucleation theory (CNT) posits 
that crystallization must start with the forma- 
tion of a nucleus that has the same molecular 
order and arrangement as do the macroscopic 
crystals, and that the building blocks are added 
one by one to the nucleus’. In the past two to 
three decades, CNT has been largely super- 
seded by two-step or multistep nucleation 
models in which an amorphous, high-density 
liquid phase forms, and then transitions into 
either a single crystalline domain (as occurs 
in some inorganic nucleation processes*) or 
numerous small, locally ordered clusters, 


which align to form growing crystals by a 
process known as oriented attachment. In light 
of these competing views, characterization of 
the nucleation events for various systems has 
been the subject of much experimental and 
theoretical work’, 

Van Driessche and co-workers’ observa- 
tions, particularly of the prismatic GI crystals, 
indicate that elements of both CNT and 
multistep nucleation might be at play. The 
authors uncover no evidence of an amorphous 
liquid phase, and the smallest GI rods, which 
they captured at very early time points (just 
20 seconds after the addition of ammonium 
sulfate), have the same crystalline registry 
as the mature crystals — findings that are in 
accord with CNT. However, the rods undergo 
oriented attachment during crystal matura- 
tion, as in the multistep model. The picture 
is less clear for the rhombic GI crystals, the 
precursors of which become observable only 
after several minutes — at which point they 
are already bigger than a nucleus. It could be 
that an amorphous, high-density liquid phase 
does form in this case but escapes detection 
because of its instability or the low contrast of 
the cryo-TEM images. 

We can, however, be certain that rapidly 
advancing techniques such as cryo-TEM, 
liquid-cell transmission electron micros- 
copy’* and in situ atomic force microscopy’, 
complemented by theory and computational 
modelling’, will continue to provide intriguing 
results with which to refine our understand- 
ing of crystal nucleation and growth. More 
practically, Van Driessche and colleagues’ 
study shows that the crystallization or 
self-assembly pathways of proteins can be 
rationally engineered at the molecular level 
to obtain a desired polymorph. This feat is 
particularly notable, given that the number of 
protein-based agents being used as pharma- 
ceutical drugs is increasing’, and that synthetic 
protein assemblies and crystals are being 
designed and constructed to have unusual and 
potentially useful properties”. = 
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Unconventional superconductivity in 
magic-angle graphene superlattices 


Yuan Caol, Valla Fatemi!, Shiang Fang’, Kenji Watanabe’, Takashi Taniguchi*, Efthimios Kaxiras** & Pablo Jarillo-Herrero! 


The behaviour of strongly correlated materials, and in particular unconventional superconductors, has been studied 
extensively for decades, but is still not well understood. This lack of theoretical understanding has motivated the 
development of experimental techniques for studying such behaviour, such as using ultracold atom lattices to simulate 
quantum materials. Here we report the realization of intrinsic unconventional superconductivity—which cannot be 
explained by weak electron-phonon interactions—in a two-dimensional superlattice created by stacking two sheets of 
graphene that are twisted relative to each other by a small angle. For twist angles of about 1.1°—the first ‘magic’ angle— 
the electronic band structure of this ‘twisted bilayer graphene’ exhibits flat bands near zero Fermi energy, resulting in 
correlated insulating states at half-filling. Upon electrostatic doping of the material away from these correlated insulating 
states, we observe tunable zero-resistance states with a critical temperature of up to 1.7 kelvin. The temperature-carrier- 
density phase diagram of twisted bilayer graphene is similar to that of copper oxides (or cuprates), and includes dome- 
shaped regions that correspond to superconductivity. Moreover, quantum oscillations in the longitudinal resistance 
of the material indicate the presence of small Fermi surfaces near the correlated insulating states, in analogy with 
underdoped cuprates. The relatively high superconducting critical temperature of twisted bilayer graphene, given such 
a small Fermi surface (which corresponds to a carrier density of about 10" per square centimetre), puts it among the 
superconductors with the strongest pairing strength between electrons. Twisted bilayer graphene is a precisely tunable, 
purely carbon-based, two-dimensional superconductor. It is therefore an ideal material for investigations of strongly 
correlated phenomena, which could lead to insights into the physics of high-critical-temperature superconductors and 


quantum spin liquids. 


Strong interactions among particles lead to fascinating states of matter, 
such as quark-gluon plasmas, various forms of nuclear matter within 
neutron stars, strange metals and fractional quantum Hall states!~?, 
An intriguing class of strongly correlated materials is the unconven- 
tional superconductors, which includes materials with a range of 
superconducting critical temperatures T., from heavy-fermion and 
organic superconductors with relatively low T- (a few to a few tens 
of kelvin) to iron pnictides and cuprates that can have T. > 100K 
(refs 4-8). Despite extensive experimental efforts to characterize these 
materials, unconventional superconductors are challenging to study 
theoretically because the models that are typically used to describe 
them cannot be solved exactly, motivating the development of alter- 
native approaches for investigating and modelling strongly correlated 
systems. One approach is to simulate quantum materials with ultra- 
cold atoms trapped in optical lattices, although technical advances are 
necessary to realize d-wave superfluidity with ultracold atoms at lower 
temperatures than are currently possible”"®. 

Here we report the observation of unconventional superconductivity 
in a two-dimensional superlattice made from graphene—specifically, 
‘magic angle’ twisted bilayer graphene (TBG). Created by the moiré 
pattern between the two graphene sheets, the magic-angle TBG super- 
lattice has a periodicity of about 13 nm, between that of crystalline 
superconductors (a few angstrém) and optical lattices (about a micro- 
metre). One of the key advantages of this system is the in situ electrical 
tunability of the charge carrier density in a flat band with a bandwidth 
of the order of 10 meV. This tunability enables us to study the phase 
diagram of unconventional superconductivity in unprecedented resolu- 
tion, without relying on multiple devices that are possibly hampered by 


different disorder realizations. The superconductivity that we observe 
has several features similar to that of cuprates, including dome struc- 
tures in the phase diagram and quantum oscillations that point to small 
Fermi surfaces near a correlated insulator state. Furthermore, it occurs 
for record-low carrier densities of the order of 10'!cm~’, orders of 
magnitude lower than the carrier densities of typical two-dimensional 
superconductors. The relatively high T-= 1.7 K for such small densities 
puts magic-angle TBG among the superconductors with the strongest 
coupling, in the same league as cuprates and the recently identified 
FeSe thin layers'!. Our results establish magic-angle TBG as a purely 
carbon-based two-dimensional superconductor and, more importantly, 
asa relatively simple and highly tunable material that enables thorough 
investigation of strongly correlated physics. 

Monolayer graphene has a linear energy dispersion at its charge neu- 
trality point. When two aligned graphene sheets are stacked, the 
hybridization of their bands due to interlayer hopping results in fun- 
damental modifications to the low-energy band structure depending 
on the stacking order (AA or AB). If an additional twist angle is present 
between layers, a hexagonal moiré pattern consisting of alternating 
AA- and AB-stacked regions emerges and acts as a superlattice modu- 
lation!?-!°, The superlattice potential folds the band structure into the 
mini Brillouin zone. Hybridization between adjacent Dirac cones in 
the mini Brillouin zone has an effect on the Fermi velocity at the charge 
neutrality point, reducing it from the typical value!*-!® of 10°m s~'. 
At low twist angles, each electronic band in the mini Brillouin zone has 
a four-fold degeneracy of spins and valleys, the latter inherited from 
the original electronic structure of graphene!*!”!°, For convenience, 
we define the superlattice density n,=4/A to be the density that 
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corresponds to full-filling of each set of degenerate superlattice bands, 
where A = 4/3 a7/(207) is the area of the moiré unit cell, a=0.246nm 
is the lattice constant of the underlying graphene lattice and 6 is the 
twist angle. In Supplementary Video, we present an animation of the 
way in which the band structure in the mini Brillouin zone of TBG 
evolves as the twist angle varies from 6 = 3° to 9=0.8°, calculated using 
a continuum model for one valley”. 

Special angles, namely the ‘magic angles, exist, at which the Fermi 
velocity drops to zero; the first magic angle is predicted’? to be 
9) = 1.1°. Near this twist angle, the energy bands near charge neu- 
trality, which are separated from other bands by single-particle gaps, 
become remarkably flat. The typical energy scale for the entire band- 
width is about 5-10 meV (Fig. 1c)!*’®. Experimentally confirmed con- 
sequences of the flatness of these bands are high effective mass in the 
flat bands (as observed in quantum oscillations) and correlated insu- 
lating states at half-filling of these bands, corresponding to n= -n,/2, 
where n = CV,/e is the carrier density defined by the gate voltage V, (C 
is the gate capacitance per unit area and ¢ is the electron charge)’®. 
These insulating states are a result of the competition between Coulomb 


Resistance, R,, (kO) 


energy and quantum kinetic energy, which gives rise to a correlated 
insulator at half-filling that has characteristics consistent with Mott-like 
insulator behaviour'®. The doping density that is required to reach the 
Mott-like insulating states is n,/2 ~ (1.2-1.6) x 10'*cm~’, depending 
on the exact twist angle. Here we report transport data that clearly 
demonstrate that superconductivity is achieved as the material is doped 
slightly away from the Mott-like insulating state in magic-angle TBG. 
We observed superconductivity across multiple devices with slightly 
different twist angles, with the highest critical temperature that we 
achieved being 1.7 K. 


Superconductivity in magic-angle TBG 

In Fig. 1a we show the typical device structure of fully encapsulated 
TBG devices. The two sheets of graphene originate from the same 
exfoliated flake, which permits a relative twist angle that is controlled 
precisely to within about 0.1°-0.2° (refs 17, 20, 21). The encapsulated 
TBG stack is etched into a ‘Hall bar and contacted from the edges’. 
Electrical contacts are made from non-superconducting materials 
(thermally evaporated Au on a Cr sticking layer) to avoid any potential 
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Figure 1 | Two-dimensional superconductivity in a graphene 
superlattice. a, Schematic of a typical twisted bilayer graphene 

(TBG) device and the four-probe (V,,, Vg, J and the bias voltage Vyas) 
measurement scheme. The stack consists of hexagonal boron nitride 

on the top and bottom, with two graphene bilayers (G1, G2) twisted 
relative to each other in between. The electron density is tuned by a 
metal gate beneath the bottom hexagonal boron nitride layer. b, Four- 
probe resistance Ry = Vyx/I (Vx and I are defined in a) measured in two 
devices M1 and M2, which have twist angles of = 1.16° and 0= 1.05°, 
respectively. The inset shows an optical image of device M1, including the 
main ‘Hall bar (dark brown), electrical contact (gold), back gate (light 
green) and SiO,/Si substrate (dark grey). c, The band energy E of TBG 
at = 1.05° in the first mini Brillouin zone of the superlattice. The 
bands near charge neutrality (E = 0) have energies of less than 15 meV. 
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d, The DOS corresponding to the bands shown in c, for energies of 

—10 to +10 meV (blue; 0 = 1.05°). For comparison, the purple lines show 
the total DOS of two sheets of freestanding graphene without interlayer 
interaction (multiplied by 10°). The red dashed line shows the Fermi 
energy E at half-filling of the lower branch (E < 0) of the flat bands, 
which corresponds to a density of n = —n,/2, where n, is the superlattice 
density (defined in the main text). The superconductivity is observed 
near this half-filled state. e, Current—voltage (V,.—I) curves for device 
M2 measured at n = —1.44 x 10!2cm~? and various temperatures. At 

the lowest temperature of 70 mK, the curves indicate a critical current 

of approximately 50 nA. The inset shows the same data on a logarithmic 
scale, which is typically used to extract the Berezinskii-Kosterlitz- 
Thouless transition temperature (Tgxr = 1.0 K in this case), by fitting to a 
Vy oc P power law (blue dashed line). 
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proximity effects. The carrier density n is tuned by applying a voltage 
to a Pd/Au bottom gate electrode. In Fig. 1b we show the longitudi- 
nal resistance R,, as a function of temperature for two magic-angle 
devices, M1 and M2, with twist angles of 1.16° and 1.05°, respectively. 
At the lowest temperature studied of 70 mK, both devices show zero 
resistance, and therefore a superconducting state. The critical temper- 
ature T. as calculated using a resistance of 50% of the ‘normal’-state 
(non-superconducting) value is approximately 1.7 K and 0.5K for the 
two devices that we studied in detail. In Fig. 1c, d we show a single- 
particle band structure and density of states (DOS) near the charge 
neutrality point calculated for = 1.05°. The superconductivity in both 
devices occurs when the Fermi energy Ef is tuned away from charge 
neutrality (Ep =0) to be near half-filling of the lower flat band (Ep <0, 
as indicated in Fig. 1d). The DOS within the energy scale of the flat 
bands is more than three orders of magnitudes higher than that of 
two uncoupled graphene sheets, owing to the reduction of the Fermi 
velocity and the increase in localization that occurs near the magic 
angle. However, the energy at which the DOS peaks does not gener- 
ally coincide with the density that is required to half-fill the bands. 
In addition, we did not observe any appreciable superconductivity 
when the Fermi energy was tuned into the flat conduction bands 
(Ep > 0). In Fig. le we show the current-voltage (I-Vxx, where Vxx 
is the four-probe voltage, as defined in Fig. 1a) curves of device M2 
at different temperatures. We observe typical behaviour for a two- 
dimensional superconductor. The inset shows a tentative fit of the 
same data to a Vyx x I? power law, as is predicted in a Berezinskii- 
Kosterlitz-Thouless transition in two-dimensional superconductors”. 
This analysis yields a Berezinskii-Kosterlitz-Thouless transition tem- 
perature of Tgxr* 1.0K at n=—1.44 x 10’? cm ~?, where, as before, 
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nis the carrier density induced by the gate and measured from the 
charge neutrality point (which is different from the actual carrier 
density involved in transport, as we show below). 

In contrast to other known two-dimensional and layered super- 
conductors, the superconductivity in magic-angle TBG requires the 
application of only a small gate voltage, corresponding to a minimal 
density of only 1.2 x 10'?cm~ from charge neutrality, an order of mag- 
nitude lower than the value of 1.5 x 10’ cm~? in LaAlO3/SrTiO;  inter- 
faces and of 7 x 10'?cm~” in electrochemically doped MoS), among 
others**. Therefore, gate-tunable superconductivity can be realized 
in a high-mobility system without the need for ionic-liquid gating or 
chemical doping. In Fig. 2a we show the two-probe conductance of 
device M1 versus n at zero magnetic field and at a 0.4-T perpendic- 
ular magnetic field. Near the charge neutrality point (n=0), a typical 
V-shaped conductance is observed, which originates from the renor- 
malized Dirac cones of the TBG band structure. The insulating states 
centred at approximately -++3.2 x 10!*cm~* (which corresponds to n, 
for = 1.16°) are due to single-particle bandgaps in the band structure 
that correspond to filling +4 electrons in each superlattice unit cell. In 
between, there are conductance minima at +2 and +3 electrons per 
unit cell. These minima are associated with many-body gaps induced by 
the competition between the Coulomb energy and the reduced kinetic 
energy due to confinement of the electronic state in the superlattice 
near the magic angle; these gaps give rise to insulating behaviour near 
the integer fillings'*. One possible mechanism for the gaps is similar 
to the gap mechanism in Mott insulators, but with an extra two-fold 
degeneracy (for the case of +2 electrons) from the valleys in the origi- 
nal graphene Brillouin zone!”!*>°, In the vicinity of —2 electrons 
per unit cell (n+ —1.3 x 10cm * ton—1.9 x 10cm”) and ata 
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Figure 2 | Gate-tunable superconductivity in magic-angle TBG. 

a, Two-probe conductance G2 = I/Vpias of device M1 (9 = 1.16°) measured 
in zero magnetic field (red) and at a perpendicular field of B; =0.4 T 
(blue). The curves exhibit the typical V-shaped conductance near charge 
neutrality (n = 0, vertical purple dotted line) and insulating states at the 
superlattice bandgaps n = +n,, which correspond to filling +4 electrons 
in each moiré unit cell (blue and red bars). They also exhibit reduced 
conductance at intermediate integer fillings of the superlattice owing to 
Coulomb interactions (other coloured bars). Near a filling of —2 electrons 
per unit cell, there is considerable conductance enhancement at zero field 
that is suppressed in B, = 0.4 T. This enhancement signals the onset of 
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superconductivity. Measurements were conducted at 70 mK; Vpias = 10 |1V. 
b, Four-probe resistance R,,, measured at densities corresponding to 

the region bounded by pink dashed lines in a, versus temperature. Two 
superconducting domes are observed next to the half-filling state, which 

is labelled ‘Mott’ and centred around —n,/2 = —1.58 x 10!cm~?. The 
remaining regions in the diagram are labelled as ‘metal’ owing to the 
metallic temperature dependence. The highest critical temperature 
observed in device M1 is T-= 0.5 K (at 50% of the normal-state resistance). 
c, As in b, but for device M2, showing two asymmetric and overlapping 
domes. The highest critical temperature in this device is T:-= 1.7 K. 
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Figure 3 | Magnetic-field response of the superconducting states 
in magic-angle TBG. a, b, Four-probe resistance as a function of 
density n and perpendicular magnetic field B, in devices M1 (a) and 
M2 (b). As well as the dome structures around half-filling (similar to 
those in Fig. 2b, c), there are oscillatory features near the boundary 
between the superconducting phase and the correlated insulator phase. 
These oscillations are indicative of phase-coherent transport through 
inhomogeneous regions in the device (Methods, Extended Data Fig. 1). 


temperature of 70 mK, the conductance is substantially higher at zero 
magnetic field than it is in a perpendicular magnetic field of B, =0.4 T, 
consistent with mean-field suppression of a superconducting state by 
the magnetic field. Here, the maximum conductance is limited only 
by the contact resistance (Fig. 2a), which is absent in the four-probe 
measurements shown in the other figures. 

In Fig. 2b, c we show the four-probe resistance of devices M1 and 
M2, respectively, as a function of density n and temperature T. Both 
devices show two pronounced superconducting domes on each side of 
the half-filling correlated insulating state. These features are similar to 
those associated with high-temperature superconductivity in cuprate 
materials. At the base temperature, the resistance inside the domes is 
lower than our measurement noise floor, which is more than two and 
three orders of magnitude lower than the normal-state resistance for 
devices M1 and M2, respectively. The I-V curves inside the domes 
display critical current behaviour (Fig. le), while being ohmic in the 
metallic phases outside the domes. Upon cooling while vis fixed at 
the middle of the half-filling state, the correlated insulating phase is 
exhibited at intermediate temperatures (from 1 K to 4K); at lower tem- 
peratures, both devices exhibit signs of superconductivity at the lowest 
temperatures. Device M1 becomes weakly superconducting, whereas 
device M2 becomes fully superconducting. This behaviour may be 
explained by a coexistence of superconducting and insulating phases 
due to sample inhomogeneity. 


Magnetic-field response 

The application of a perpendicular magnetic field B, to a two- 
dimensional superconductor creates vortices that introduce dissipation 
and gradually suppress superconductivity”’. In Fig. 3a, b we show the 
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c, Differential resistance dV,,,/dI versus d.c. bias current J for different B,, 
measured for device M2. d, R,,-T curves for different B, , measured for 
device M1. e, Perpendicular (B.,) and parallel (B.) critical magnetic field 
versus temperature for device M1 (triangles; at 50% of the normal-state 
resistance). The fitting curves for B., correspond to Ginzburg-Landau 
theory for a two-dimensional superconductor. B,\ is fitted to 
B.y(0)(1—T/T.) "7 , where B,1(0) is the parallel critical field at zero 
temperature. Measurements in a—c were conducted at 70 mK. 


resistance of devices M1 and M2 as a function of density and B,. Both 
devices exhibit a maximum critical field of approximately 70 mT. The 
critical field varies strongly with doping density, showing two similar 
domes on each side of the half-filling state. Near the Mott-like insu- 
lating state (n + —1.47 x 10!2cm~? ton —1.67 x 10'*cm~? for M1; 
n&—1.25 x 10"%cm? to n&—1.35 x 10cm ? for M2), periodic 
oscillations of the resistance and critical current as a function of B, 
appear (see Methods and Extended Data Fig. 1 for detailed analysis). 
The oscillations seem to originate from phase-coherent transport 
through arrays of Josephson junctions, similarly to superconducting 
quantum interference device (SQUID)-like superconductor rings 
around one or more insulating islands. These junction regions could 
be due to slight density inhomogeneities in the devices, which would 
cause a few islands to be doped into the insulating phase while other 
parts of the device remain superconducting. Apart from these oscilla- 
tory behaviours near the boundary of the half-filling insulating state, 
the critical current and zero resistivity inside the domes are gradually 
suppressed by B, (Fig. 3c, d). 

In Fig. 3e we show the critical magnetic field versus temperature for 
device M1, under perpendicular and parallel field configurations. The 
temperature dependence of the perpendicular critical field B., is well 
described by Ginzburg-Landau theory: B., = [®o/(2né2,)](1 — T/T.) 
where &p = h/(2e) is the superconducting flux quantum, h is the Planck 
constant, and €g, is the Ginzburg-Landau superconducting coherence 
length, determined from the fit to be &g, ~52nm at T=0. On the 
other hand, the in-plane critical field dependence is not well explained 
by the Ginzburg-Landau theory for thin-film superconductors, owing 
to the atomic thickness of TBG (0.6 nm); at this thickness, the theory 
predicts an in-plane critical field of B., > 36 T as the temperature 
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Figure 4 | Temperature-density phase diagrams of magic-angle TBG 
at different magnetic fields. a—c, R,,-T curves for device M1 at different 
densities (see legend), measured in B, =0 T (a), B| = 0.4 T (b) and 

B, =8T (c). The magnetic field induces a superconductor-insulator- 


approaches zero’. Instead, we interpret the dependence of T. on the 
in-plane magnetic field B, as a result of paramagnetic pair-breaking 
owing to the Zeeman energy. The zero-temperature in-plane critical 
field is extrapolated to be around 1.1 T, which is higher than but close 
to the value in the Pauli limit of By 1.85 T K"! x T.~0.93 T, estimated 
on the basis of the Bardeen—Cooper-Schrieffer (BCS) gap formula 
Aw 1.76kgT., where kg is the Boltzmann constant. 

We note that the superconductor—metal transition in magic-angle 
TBG is not sharp, so extracting both B, and T- has some uncertainty. 
Qualitatively, the dependence of the in-plane critical field on tempera- 
ture is B,, x (1 — T/T.)"? near T. (ref. 27). The results described above 
are consistent with the existence of two-dimensional superconductivity 
confined in an atomically thin space. As we show in the following, 
the coherence length € is comparable to the inter-particle spacing and 
might suggest that the system is driven close to a crossover between 
a BCS-like state and a Bose-Einstein condensate (the BCS-BEC 
crossover). 


Phase diagram of magic-angle TBG 

The phase diagram of magic-angle TBG consists of correlated insu- 
lator phases and superconducting phases, which can be realized via 
continuous tuning of temperature, magnetic field and carrier density. 
Similarly to the superconducting domes discussed above, the correlated 
Mott-like insulator phase at half-filling also assumes a dome shape, 
with a transition to a metallic phase at about 4-6 K and centred around 
half-filling density. It has been shown'* that the Mott-like insulator 
phase crosses over to a metallic phase upon application ofa strong mag- 
netic field of around 6 T either perpendicular or parallel to the devices. 
A plausible explanation for this crossover is that the many-body charge 
gap is closed by the Zeeman energy. 

In Fig. 4a—c we show the resistance versus temperature data meas- 
ured in device M1 at zero magnetic field, B, =0.4 T and B, =8T, 
respectively. At zero field, we observe the transition from a metal at high 
temperatures (above 5 K) to a superconductor. Close to half-filling there 
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metal transition at the lowest temperatures. d—f, Schematic phase diagrams 
for the magnetic fields in a~c. The horizontal axis shows the relative filling 
n/n,. Short coloured lines at the top and bottom of the plots denote the 
densities plotted in a-c. 


is an intermediate region in which insulating temperature dependence 
is observed from about 1 K to 4K (above T.); we identify this region 
as corresponding to the Mott-like insulating phase at half-filling. In a 
small magnetic field B, =0.4 T, which is above the critical magnetic 
field, the system remains an insulator down to zero temperature near 
half-filling and a metal away from half-filling. Finally, in a strong mag- 
netic field B, =8 T, the correlated insulator phase is fully suppressed 
by the Zeeman effect and the system is metallic everywhere between 
n=~—n, and the charge neutrality point. Our data highlight the rich 
phase space of metal-insulator-superconducting physics in magic- 
angle TBG**. A schematic of the evolution of the phase diagram as the 
magnetic field increases is shown in Fig. 4d-f. 


Quantum oscillations in the normal state 

We studied quantum oscillations in the entire accessible density range, 
including in the vicinity of the correlated insulating state at which 
superconductivity occurs. In Fig. 5a, b we show the Shubnikov—de 
Haas oscillations in longitudinal resistance R, as a function of carrier 
density for the hole-doped region (Ep < 0) for device M2. The Landau 
levels in a TBG superlattice typically follow n/n,=N@/¢@o + s, where 
@=B A is the magnetic flux that penetrates each unit cell, 69 =h/e 
is the (non-superconducting) flux quantum, N= +1, +2, +3, ... is 
the Landau-level index, s=0 denotes the Landau fan that emanates 
from the Dirac point, and s= +1 denote the Landau fans that result 
from electron-like or hole-like quasiparticles near the band edges of 
the single-particle superlattice bands in the mini Brillouin zone, which 
emanate from +n,. The Landau levels also exhibit a four-fold degen- 
eracy due to spins and valleys, and so the filling-factor sequence is 
4, +8, +12,... 

Unexpectedly, in addition to these expected Landau fans, we also 
observe a Landau fan that emanates from the correlated insulating 
state at —n,/2. This Landau fan has N= —1/2, —1, —3/2, —2, ... (that 
is, filling factors of —2, —4, —6, —8, ...) and s=—1/2. The supercon- 
ducting dome is distinguishable in Fig. 5a directly beneath this Landau 
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Figure 5 | Quantum oscillations in magic-angle TBG at high fields. 

a, Resistance R,, versus density n (hole-doped side with respect to charge 
neutrality) and B, in device M2. The lower half of the diagram shows the 
Landau-level structure deduced from the oscillations. The blue Landau fan, 
which originates from the charge neutrality point (CNP), and the purple 
Landau fan, which originates from the superlattice density (n = -n,, yellow 
shaded region), illustrate the filling-factor sequences —4, —8, —12, ... 
expected from the single-particle band structure with four-fold spin and 
valley degeneracies. The additional red fan, which originates from —n,/2 
(red shaded region), instead has a filling-factor sequence of —2, —4, —6,... 


fan, being very close to zero field and next to the correlated insulating 
region. Unlike commonly observed broken-symmetry states that split 
from a single degenerate Landau level into multiple levels, the halved 
filling factors appear to be intrinsic to the fan, holding down to the 
lowest magnetic field at which oscillations are still visible. Fractional 
values for s have been reported in graphene superlattices as a result 
of Hofstadter’s butterfly, which typically occurs in much stronger 
magnetic fields (greater than 10 T) but becomes obvious only at the 
intersection of Landau levels with different integer s (refs 29-31). 
Therefore, the physics of Hofstadter’s butterfly cannot explain the addi- 
tional stand-alone fan observed here, which appears at fields as low 
as 1 T. Furthermore, the halving of the filling factors and s is unlikely 
to be explained in a non-interacting picture of unit-cell doubling due 
to strain or to the formation of a charge density wave, in which case 
either spin or valley degeneracy must be broken. We observed the same 
Landau level sequence in two other magic-angle TBG devices, so it 
is robust against small variations in twist angle and consistent across 
samples (Methods, Extended Data Fig. 2). 

To study the non-trivial origin of the Landau fan near half-filling fur- 
ther, we measured the effective mass from the temperature-dependent 
quantum oscillation amplitude according to the Lifshitz—Kosevich 
formula (Methods). In Fig. 5b, c we show the oscillations and oscil- 
lation amplitudes at three different densities (indicated by arrows in 
Fig. 5a). In Fig. 5d, e we show the oscillation frequency fgay and the 
effective mass extracted by fitting the oscillation amplitudes to the 
Lifshitz—Kosevich formula. The dependence of fsqy on carrier density 
n provides another perspective on the oscillations because the value 
of M= @pAn/Afsay extracted from the slope An/ Afgan provides the 
number of degenerate Fermi pockets M directly. The experimental data 
clearly fit to M=4 near the charge neutrality point and for densities 
beyond the superlattice gap, whereas M =2 for the quantum oscilla- 
tions that start near the correlated insulator state and right above the 
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that is not expected from the single-particle band structure. b, Temperature- 
dependent quantum oscillation traces AR,,,/R,(B=1 T) at the carrier 
densities labelled A, B and C in a. From black to orange, the temperatures 
are 0.7 K, 1.2K, 2.0K, 3.0K, 4.2K, 6K, 10K, 15K, 20K and 30K. ¢, Lifshitz 
Kosevich fit (solid lines) of the normalized amplitudes of the oscillations 
shown in b (data points). d, e, Shubnikov—de Haas oscillation frequencies 
fsan and effective masses m*/m, as a function of carrier density n. The 

error bars correspond to the 90% confidence level in fitting to the Lifshitz- 
Kosevich formula (see Methods for definition). M= ¢)An/Afgay is the 
Fermi surface degeneracy. 


superconducting dome. The effective mass of the anomalous oscilla- 
tions is about (0.2—-0.4)m,., where m, is the bare electron mass. This 
mass is much larger than the mass near charge neutrality (about 0.1m.) 
and beyond the superlattice gap (about 0.05m,) at the same An, where 
An is density relative to the value of n at which fsaH = 0 in Fig. 5d. 

The quantum oscillations above the superconducting dome clearly 
indicate the existence of small Fermi surfaces that originate from 
the correlated insulating state, which have areas proportional to 
n' =|n| — n,/2, rather than of a large Fermi surface with an area that 
corresponds to the density |n| itself. The Hall measurements shown 
in Extended Data Fig. 3 also support this conclusion. Notably, similar 
small Fermi pockets that do not correspond to any pockets in the 
single-particle Fermi surface have been observed in underdoped 
cuprates, although their origin is debated**-**, Among the possibilities, 
the small Fermi surface that we observe could be the Fermi surface 
of quasiparticles that are created by doping a Mott insulator®**. On 
the other hand, the halved degeneracy might be related to spin- 
charge separation, as predicted in a doped Mott insulator*>. More 
experimental and theoretical work is needed to clarify the origin of 
the quantum oscillations. 


Discussion 

The appearance of both superconductor and correlated insulator 
phases in the flat bands of magic-angle TBG at such a small carrier 
density cannot be explained by weak-coupling BCS theory. The 
carrier density that is responsible for T- = 1.7 K is extremely small 
according to the quantum oscillation measurements, merely 
n'=1.5 x 10''cm~ at optimal doping. To place this in the context 
of other superconductors, in Fig. 6 we plot T- against Ty on a logari- 
thmic scale for various materials, where Ty is the Fermi temperature. 
Tr is proportional to the two-dimensional carrier density nzp, which 
the quantum oscillations data show to be equivalent to n’ for the 
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Figure 6 | Superconductivity in the strong-coupling limit. Logarithmic 
plot of critical temperature T- versus Fermi temperature Ty for various 
superconductors*®. The top axis is the corresponding two-dimensional 
carrier density np for two-dimensional materials or n3/? for three- 
dimensional materials, normalized by the effective mass m”/m, and the 
Fermi surface degeneracy g (and a constant factor of 1/1.52 for the three- 
dimensional density). Two-dimensional superconductors are represented 
by filled circles; other symbols represent three-dimensional (but 
potentially two-dimensional-like) superconductors. For comparison, we 
also plot Tgrc = 1.04hn3p/m* for a three-dimensional bosonic gas (dashed 
line). Bose-Einstein condensation temperatures in “He, paired fermionic 
40K and paired fermionic °Li are shown as open pink squares*°4 (T; and Ty 
have both been multiplied by 108 for *°K and °Li). The point for magic- 


superconducting dome region of magic-angle TBG**. Most uncon- 
ventional superconductors have T./Ty values of about 0.01-0.05, 
whereas all of the conventional BCS superconductors lie on the 
far right in the plot, with much smaller ratios. Magic-angle TBG is 
located above the trend line on which most cuprates, heavy-fermion 
and organic superconductors lie, with a T./Tp value approaching that 
of the recently observed exotic FeSe monolayer on SrTiO; (Fig. 6 
inset). This finding strongly suggests that the superconductivity in 
magic-angle TBG originates from electron correlations instead of 
weak electron-phonon coupling. One other frequently compared 
temperature is the Bose-Einstein condensation temperature for a 
three-dimensional boson gas Tpgc, assuming that all particles in the 
occupied Fermi sea pair up and condense. Cuprates and other uncon- 
ventional superconductors typically have T-/Tpgc ratios of roughly 
0.1-0.2. The T./Tgrc ratio for magic-angle TBG is estimated to be 
up to 0.37, indicating very strong electron-electron interactions and 
possibly close proximity to the BCS-BEC crossover. This behaviour 
is in agreement with the fact that the coherence length in magic-angle 
TBG (€~50nm at optimal doping) is of the same order of magnitude 
as the average inter-particle distance, (n')~!? ~26nm. 

The realization of unconventional superconductivity in a graphene 
superlattice establishes magic-angle TBG as a relatively simple, clean, 
accessible and, most importantly, highly tunable material, which could 
be used to study correlated electron physics. The interactions in magic- 
angle TBG could possibly be further fine-tuned by the twist angle and 
by the application of perpendicular electric fields by means of differ- 
ential gating'*°”. Moreover, T- could possibly be enhanced further by 
applying pressure to the graphene superlattice to increase the interlayer 


angle TBG (large red filled circle) is calculated from the two-dimensional 
density and the effective mass obtained from quantum oscillations 

(Fig. 5d, e) at the optimal doping (2p = 1.5 x 10!!cm~* and m* =0.2m.), 
using g= 1 to account for the halved degeneracy. Data for other materials 
are from refs 36,45-54. The blue shaded region is the approximate region 
in which almost all known unconventional superconductors lie. The inset 
shows the variation in T./T as a function of doping n’ for magic-angle 
TBG (red filled circles). The horizontal dashed lines are the approximate 
T./ Tz values of the corresponding material. YBCO, YBazCu307_»5; LSCO, 
Laz_,Sr,CuO4; BSCCO, BiSrzCa,Cu3O,; LAO, LaAlO3; STO, SrTiOs; 

1L, single layer; EDLT, electric double-layer transistor; BEDT, 
bisethylenedithiol; TMTSE, tetramethyltetraselenafulvalene. 


hybridization or by coupling different magic-angle TBG structures to 
induce Josephson coupling in the vertical direction**®. Similar magic- 
angle superlattices and flat-band electronic structures could also be 
realized with other two-dimensional materials or lattices to investigate 
strongly correlated systems with different properties. 

Finally, despite several apparent similarities between magic-angle 
TBG and cuprates, there are key differences between the realizations 
of them. First, the valley degree of freedom in the underlying graphene 
lattices leads to an extra degeneracy, resulting in two carriers per 
superlattice unit cell at half-filling in the parent correlated insulator 
state. Higher quality devices and fine tuning may lead to supercon- 
ductivity near the regions corresponding to one and three carriers per 
unit cell. Second, in magic-angle TBG the underlying superlattice is 
triangular, which should have a fundamental influence on the type of 
spin-singlet ground state it can host, owing to magnetic frustration. 
The lattice symmetry should also impose limitations on the possible 
superconducting pairing symmetry in magic-angle TBG; further 
experiments, for example, involving tunnelling and Josephson hetero- 
junctions, are required to confirm this*’. Various pairing symmetries, 
including (d + id’)-wave, (py + ipy)-wave and spin-triplet s-wave 
symmetries, have been predicted theoretically in the hypothetical 
superconductivity of monolayer or few-layer graphene“. If the 
mechanism for superconductivity in magic-angle TBG is indeed 
related to the correlated half-filling insulating state, as is the case in 
d,.2_,2-wave cuprates, then the pairing symmetry might be chiral 
(d + id’)-wave, to satisfy the underlying triangular symmetry of the 
superlattice. We anticipate that further experimental and theoretical 
work on magic-angle TBG and related magic-angle superlattices will 
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provide insights into the key factors that govern unconventional super- 
conductivity, and bring us closer to realizing tunable quantum spin 
liquids®’. 
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METHODS 


Sample preparation. The devices were fabricated using a modified dry-transfer 
technique’”'®?°. Monolayer graphene and hexagonal boron nitride (about 
10-30 nm thick) were exfoliated on SiO2/Si chips and high-quality flakes were 
picked using optical microscopy and atomic force microscopy. We used a 
poly(bisphenol A carbonate) (PC)/polydimethylsiloxane (PDMS) stack on a glass 
slide mounted on a custom-made micro-positioning stage to pick up a hexagonal 
boron nitride flake at 90°C, and then used the van der Waals force between hexa- 
gonal boron nitride and graphene to tear a graphene flake at room temperature. 
The separated graphene pieces were rotated manually by a twist angle of about 
1.2°-1.3° and stacked together again, which resulted in a controlled TBG 
structure. The stack was encapsulated with another hexagonal boron nitride flake 
on the bottom and released onto a metal gate at 160°C. We did not perform any 
heat annealing after this step because we found that TBG tended to relax to Bernal- 
stacked bilayer graphene at high temperatures. The final device geometry was 
defined by using electron-beam lithography and reactive ion etching. Electrical 
connections were made to the TBG by Cr/Au edge-contacted leads”. 
Measurements. Transport measurements were performed in a dilution refrigerator 
with a base temperature of 70 mK, except for the temperature-dependent quantum 
oscillations, which were measured in a *He fridge. 

We used standard low-frequency lock-in techniques with an excitation fre- 
quency of about 5-10 Hz and an excitation current of about 0.4-5 nA. The current 
flowing through the sample was amplified by a current pre-amplifier and measured 
by the lock-in amplifier. The four-probe voltage was amplified by a voltage pre- 
amplifier at x 1,000 gain and measured by another lock-in amplifier. 

The twist angle of the devices was determined from the transport measurements 

at low temperatures". In brief, a rough estimate of the twist angle is provided by the 
carrier density of the superlattice gaps at -tn,, which present as strongly insulating 
states. To refine this estimate, the Landau levels that appear at high magnetic fields 
were fitted to the Wannier diagram, which gives the twist angle with an uncertainty 
of about 0.01°-0.02°. 
Extracting the quantum oscillation frequency and effective mass. The effective 
mass in device M2 was extracted using the standard Lifshitz-Kosevich formula, 
which relates the temperature-dependence of resistance change AR,,(T) to the 
cyclotron mass m* (at a given magnetic field B ): 


x _ 2n*kpTm* 
> heB, 


AR,(T) x = 
sinh(y) 


For each gate voltage (carrier density), we measured the R,,-B, curves at different 
temperatures, normalized them by their low-field values and subtracted a common 
polynomial background in B,. Examples of the curves are shown in Fig. 5b. The 
oscillation frequencies shown in Fig. 5d were extracted from these curves plotted 
versus 1/B,. From the temperature-dependent amplitude of the most prominent 
peak, we extracted m* using the above equation (Fig. 5e). The error bars in Fig. 5d, 
e represent 90% confidence intervals of the fit. 

Commensuration and twist angle. Mathematically, in a twisted moiré system, 
the lattice is strictly periodic only when the twist angle satisfies a specific relation 
such that lattice registration order is perfectly recovered in a finite distance. These 
special cases are termed ‘commensurate’ structures. One important parameter in 
commensurate TBG structures is r, which can be intuitively understood as the 
number of ‘apparent’ moiré pattern wavelengths that it takes to recover the lattice 
periodicity fully'!*°°. The simplest commensurate structures with r= 1 are called 
‘minimal structures. These structures have exactly one moiré spot per unit cell. In 
TBG, as well as the minimal structures, which occur only at discrete angles, there 
are other commensurate structures that are arbitrarily close to any given angle 0 
with large r. However, at small twist angles, the evolution of the band structure of 
TBG can be viewed as semi-continuous; that is, an infinitesimal change in twist 
angle does not have a substantial effect on the band structure even though the 
lattice could be in a different family of commensurate structures (different r)!”. 
In other words, the TBG system can be well approximated by a continuum 
model, as originally proposed in ref. 12, and the physics in minimal structures is 
representative of all nearby commensurate structures”. In our experiments, we 
do not expect the lattice to be in perfect commensuration, owing to disorder and 
intrinsic randomness due to the fabrication process. However, we think that the 
continuum model can faithfully represent the realistic TBG system in which any 
commensuration effect has been smoothed out. 

We deduced the size of the moiré unit cell and the twist angle on the basis of the 
density of the superlattice gaps +n, (+4 electrons per moiré unit cell), and then 
cross-checked the twist angle with the Landau levels observed at high magnetic 
fields. +n, are the only multiples of n, that correspond to Fermi energies located 
within single-particle band gaps and therefore exhibit strong insulating behaviour. 
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For twist angles above about 0.9°-1°, the band structure at energies higher than 
these gaps is strongly overlapping and no single particle gaps at +2n,, +3n,, ... 
appear!*!+195°. The experimentally measured values for the single-particle 
insulating gaps that we observe are in the approximately 30-60-meV range!”!8. 
However, below about 0.9°-1°, the superlattice gaps at +n, close and there is no 
single-particle gap at any energy in the system*!”®. In this regime, there are Dirac- 
like bands that cross at +2n, which might be responsible for the resistance peaks 
observed in devices with very small twist angles, although possible interaction 
effects may enhance these peaks”!. The states observed in very-low-twist devices 
are clearly different from the strong insulating gaps observed here and previously!®. 
There is a marked change in the band structure at about 0.9°-1° (depending on 
the parameters of the model being used), which leads to a transition from single- 
particle gaps at +n, to resistive states at -+2n;. This crossover can be observed 
clearly in Supplementary Video, in which we show an evolution of the band struc- 
ture of TBG from @= 3° to = 0.8°. The data in the video were calculated using 
the continuum model!” 

Possible effects due to finite electrical fields. It has been shown that by applying a 
perpendicular electrical field to Bernal-stacked bilayer graphene, topological states 
can emerge on the AB/BA stacking boundaries while the bulk of the AB and BA 
regions remains gapped*”~”. In small-angle TBG, a similar effect can alter the band 
structure because the AA-stacked regions in the moiré pattern are interconnected 
by the AB/BA stacking boundaries. This effect has been observed recently in scan- 
ning tunnelling experiments on ultrasmall-twist-angle samples”. 

The question then arises of how the flat bands in magic-angle TBG are affected 

by the network of topological boundaries when a residual electrical field is present. 
Theoretical work on 6= 1.5° TBG has shown that when an inter-layer potential 
difference of AV= 300 mV is applied the low-energy superlattice bands become 
even flatter and the electronic states become more localized*”. Therefore, there is 
good reason to believe that the flat-band physics presented here holds even when 
a perpendicular electric field is present, because the electric field will probably 
render the band structure even more localized and correlated as the twist angle 
approaches the magic angle. In our experiments, we estimate that the potential 
difference between the two layers induced by our gate voltage is at most about 
50 mV, and probably much less, owing to screening. Any possible effects of the 
residual electric field should be minimal. 
Phase-coherent transport behaviour in superconducting magic-angle TBG. In 
Fig. 3a, b we observe oscillatory behaviour in the measured longitudinal resistance 
Ryx as a function of perpendicular magnetic field B; when the charge density is 
close to the boundary between the half-filling insulating state and the supercon- 
ducting states. The oscillations are most clearly seen for 1 —1.70 x 10cm ~? 
ton —1.60 x 10cm? and n= —1.50 x 10cm? to n& —1.47 x 10% cm? 
in device M1. 

In Extended Data Fig. 1a, b we show the differential resistance dV,./dI 
versus bias current J and perpendicular magnetic field B,. At zero bias current, 
the oscillations of the differential resistance with B, shown correspond to line 
cuts in Fig. 3a at densities of n= —1.48 x 10’’ cm? (Extended Data Fig. 1a) and 
n= —1.68 x 10!2cm~? (Extended Data Fig. 1b). The critical current, above which 
the superconductor becomes normal, oscillates with B, at the same frequency, as 
can be visualized by the bright peaks in Extended Data Fig. 1a, b. The oscillation 
period is AB=22.5 mT in Extended Data Fig. 1a and about AB=4 mT in Extended 
Data Fig. 1b. 

The fact that the critical current is maximum at zero B, and oscillates at peri- 
odic intervals of the magnetic field suggests the existence of Josephson junction 
arrays—in the simplest case, a superconducting quantum interference device 
(SQUID)-like superconducting loop, around a normal or insulating island”. It is 
unclear whether this inhomogeneous behaviour is a result of sample disorder or 
a coexistence of two different phases (such as the superconducting phase and the 
correlated insulator phase). Owing to the two-dimensional nature of our devices, 
the detailed current distribution in the device cannot be uniquely determined at 
this moment by transport measurements; however, from the oscillation period we 
deduce the effective loop area of the SQUID approximately using $= @o/AB, where 
y= h/(2e) is the superconducting quantum flux. (Note the difference between 
oo=h/e for the quantum Hall effect and &p = h/(2e) for superconductivity.) For 
the experimental data in Extended Data Fig. 1a, b, we obtain areas of S=0.09 jum? 
and S=0.5 1m’, respectively. By comparison, the total device area between the 
voltage probes is approximately 1 um. 

Using a simple model of a SQUID with a phenomenological decay of the oscil- 
lation amplitude at higher magnetic fields, we attempt to reproduce the observed 
oscillations qualitatively using numerical simulations. In Extended Data Fig. 1c we 
show the simulated I-B, map of the differential resistance for a SQUID with area 
S$=0.09 m2, with the same critical current I-y = I~. =7nA in the two branches, 
corresponding to the experimental data in Extended Data Fig. 1a. In Extended Data 
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Fig. 1d we show the simulation for an asymmetric SQUID with area S=0.5 1m? 
and critical currents of J; =6nA and I..=10nA for the two branches, which 
account for the partial cancellation of the critical current at low fields (that is, 
the total critical current does not reach zero in an oscillation) seen in Extended 
Data Fig. 1b. These simulations provide a qualitative perspective on the oscillatory 
phenomenon; the actual supercurrent distribution is probably much more com- 
plex and will need to be established via magnetic imaging techniques. However, 
our data indicate that the superconducting behaviour that we observe is indeed 
a phase-coherent phenomenon. Although we did not fabricate SQUID devices 
deliberately using magic-angle TBG, these periodic oscillations of the critical cur- 
rent in B, are probably a result of the Josephson effect through a superconductor 
with insulating puddles, further confirming the existence of superconductivity in 
magic-angle TBG. 

Induced superconductivity in graphene and graphene-based systems through 
proximity to another superconductor has been demonstrated, and graphene-based 
Josephson junctions continue to be explored®!. Superconductivity in graphene 
induced by proximity to a high-T. superconductor has been reported recently, and 
indications of induced unconventional pairing have been observed*®. 
Supplementary quantum oscillation data and low-field Hall effect. In Extended 
Data Fig. 2 we show magneto-transport data for device M1 and another magic- 
angle device D1. Both devices show evidence for the existence of an extra Landau 
fan with a degeneracy of M=2 that emerges from the half-filling insulating states. 
All of the magic-angle devices that we have measured so far display quantum oscil- 
lations that correspond to emergent quasiparticles on one side of the half-filling 
states—the one that is away from the charge neutrality point (that is, n << —n,/2 
for Ep<0and n> n,/2 for Ez > 0; see ref. 18 for the Ez > 0 data)—but not the other 
(n>—n,/2 or n<n,/2). The Hall measurements reported below exhibit a similar 
asymmetry around the half-filling state. This universally asymmetric behaviour, 
regardless of the twist angle, might be explained if the effective mass of the qua- 
siparticles on the side closer to charge neutrality is much larger, and therefore 
the corresponding quasiparticle has a much lower mobility, so that the quantum 
oscillations cannot be observed and their contribution to the Hall effect becomes 
negligible. Further theoretical work could potentially shed more light on the true 
nature of the many-body energy gap and the related quasiparticles. 

We determined the Fermi surface area in the magic-angle TBG devices using the 
Shubnikov-de Hass oscillation frequency in a magnetic field (Fig. 5). We find that 
oscillations emerge from the correlated insulating state at half-filling n= —n,/2, 
and the oscillation frequency indicates small Fermi pockets associated with a 
shifted density of n’ = |n| — n,/2. 

In Extended Data Fig. 3 we show another measurement of the transport carrier 
density via the low-field Hall effect measured up to +1 T. The measured 
Hall density, given by ny = — (1/e)(dRxy/ dB), 9? provides an independent 


measurement of the carrier density in the system. In both devices, at a temperature 


of 0.4K we observe that, whereas the Hall density follows the gate-induced density 
closely (m4=n) near charge neutrality and up to the half-filling insulating states 
at |n| =n,/2, it ‘resets to a much smaller value beyond |n| =n,/2. The Hall density 
beyond these points behaves as if the charge carriers that contribute to transport 
are just those added beyond |n|=n,/2, and roughly follows ny =n + n,/2 for 
n<—n,/2 and ny=n — n,/2 for n> n,/2. This behaviour is in agreement with the 
measurements of the quantum oscillation frequency shown in Fig. 5d. 

This resetting effect is quickly suppressed by raising the temperature to about 
10K. Beyond this temperature the Hall density increases monotonically towards 
the band edge. At these higher temperatures, the Hall density in the flat bands no 
longer follows ny =n. This could possibly be explained by the thermal energy kT 
being close to the bandwidth of the flat bands, in which case the Hall coefficient 
must take into consideration the contributions from carriers that are thermally 
excited into the higher-energy, highly dispersive bands, which have opposite 
polarity. By contrast, up to 30K, the Hall density measured at very high densities 
(|n| >n,) exhibits very linear behaviour according to |ny|=|n| — n, regardless of 
the temperature, which is consistent with the highly-dispersive, low-mass bands 
above and below the flat bands, as seen in Fig. Ic. 

Data availability. The data that support the findings of this study are available 
from the corresponding authors on reasonable request. 
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Extended Data Figure 1 | Evidence of phase-coherent transport in 
superconducting magic-angle TBG. a, b, Differential resistance dV/dI 
versus bias current J and perpendicular field B,, at two different charge 
densities n, corresponding to those in Fig. 3a. Periodic oscillations are 


observed in the critical current (identified approximately as the position 
of the bright peaks in dV/d1). c, d, Simulations intended to reproduce 
qualitatively the behaviour observed in a and b. 
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Extended Data Figure 2 | Supplementary quantum oscillation data. 

a, b, Quantum oscillations in device M1 (a; 6= 1.16°, data shown for R,) 
and device D1 (b; = 1.08°, data shown for the two-probe conductance 
G)). The first derivative with respect to the gate-defined charge density n 
has been taken in both cases to enhance the colour contrast. Both devices 


exhibit a Landau fan that emerges from the half-filling state —n,/2 and 
have a Landau level sequence of —2, —4, —6, —8, ..., consistent with the 
results shown in Fig. 5. By comparison, the Landau fans that start from 
charge neutrality have a sequence of —4, —8, —12,... 
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Extended Data Figure 3 | Low-field Hall effect in magic-angle TBG. 31.8 K. Coloured vertical bars correspond to densities of —n,, —n,/2, ns/2 
a, b, Low-field Hall effect for devices M1 (a) and M2 (b). The Hall density and n, for the two samples. Dashed lines are the expected Hall density if 
ny = — (1/e)(dRyy/ dB), is plotted as a function of the total charge the offset given in the corresponding formula is considered. 


density induced by the gate (n), measured at temperatures from 0.4 K to 
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The logic of single-cell projections from 


visual cortex 


Yunyun Han!*3*, Justus M. Kebschull*>*, Robert A. A. Campbell?*, Devon Cowan?, Fabia Imhof*, Anthony M. Zador°§ & 


Thomas D. Mrsic-Flogel*°s 


Neocortical areas communicate through extensive axonal projections, but the logic of information transfer remains poorly 
understood, because the projections of individual neurons have not been systematically characterized. It is not known 
whether individual neurons send projections only to single cortical areas or distribute signals across multiple targets. Here 
we determine the projection patterns of 591 individual neurons in the mouse primary visual cortex using whole-brain 
fluorescence-based axonal tracing and high-throughput DNA sequencing of genetically barcoded neurons (MAPseq). 
Projections were highly diverse and divergent, collectively targeting at least 18 cortical and subcortical areas. Most 
neurons targeted multiple cortical areas, often in non-random combinations, suggesting that sub-classes of intracortical 
projection neurons exist. Our results indicate that the dominant mode of intracortical information transfer is not based on 
‘one neuron-one target area’ mapping. Instead, signals carried by individual cortical neurons are shared across subsets 
of target areas, and thus concurrently contribute to multiple functional pathways. 


While the inputs received by a neuron drive its activity, its axonal 
projections determine its effects on other neurons. The axons of 
excitatory projection neurons that reside in layers 2 and 3 (hereafter 
2/3), and 5 and 6 of the neocortex are the main conduit by which 
signals are exchanged between cortical areas’. To date, no study has, 
to our knowledge, systematically investigated the principles by which 
individual neurons in any region of the mammalian neocortex dis- 
tribute information to their targets. This knowledge is fundamental 
for deducing the logic of the communication between areas, for con- 
straining hypotheses about neural function and for the identification 
of putative sub-classes of neurons. Anatomical studies in macaques, 
cats and mice, which are mostly based on retrograde tracing meth- 
ods, indicate that there is an abundance of intracortical projection 
neurons in the sensory neocortex, which have axons that appear 
to innervate single target areas”, raising the possibility that infor- 
mation may be distributed through ensembles of dedicated path- 
ways that are functionally tailored to each target®'?. For example, 
neurons in the mouse primary visual cortex (V1) that innervate the 
posteromedial (PM) or anterolateral (AL) area appear to match the 
spatial and temporal frequency preference of these target areas”!*>"*. 
Similarly, neurons in the mouse primary somatosensory cortex 
that project to either the primary motor cortex or the secondary 
somatosensory area comprise largely non-overlapping populations 
with distinct physiological and functional properties®”'°. These 
findings indicate that dedicated lines—specialized subpopulations of 
neurons that preferentially target a single downstream area (Fig. 1a, 
left)—may represent a fundamental mode of cortico-cortical com- 
munication. Alternatively, intracortically projection neurons could 
broadcast to multiple targets*>*"', either randomly (Fig. 1a, middle) 
or by targeting specific sets of areas (Fig. la, right). These three models 
of cortical architecture have different implications for communica- 
tion between areas underlying sensory processing in hierarchical net- 
works. To distinguish between these models, we used two anterograde 


anatomical approaches, whole-brain fluorescence-based axonal trac- 
ing and high-throughput DNA sequencing of genetically barcoded 
neurons (MAPseq), to map the long-range axonal projection patterns 
of individual neurons in the mouse primary visual cortex, an area 
that distributes visual information to multiple cortical and subcortical 
targets*?-*, 


Fluorescence-based tracing of single neurons 

We first traced the projections of single neurons using whole-brain 
fluorescence-based axonal reconstructions. We used single-cell electro- 
poration of a GFP-encoding plasmid to label up to six layer-2/3 cells in 
the right visual cortex of each mouse. After allowing 3-10 days for GFP 
expression, we imaged the axonal projections of the labelled neurons 
by whole-brain serial two-photon tomography with 1 x 1 x 10-j1m? 
resolution’>4 (Fig. 1b). We then traced each fluorescently labelled cell 
(n=71; Fig. 1c, d) and registered each brain to the Allen Reference 
Atlas?° (Fig. le, f). To assess axonal labelling with GFP, we electro- 
porated neurons labelled retrogradely from the ipsilateral striatum, 
and in all cases observed axonal terminations therein (n = 9/9 cells; 
Extended Data Fig. 1), indicating a low false-negative rate of filling axon 
collaterals in distal targets of V1 neurons. Nonetheless, to minimize 
any possible contribution of incomplete axonal filling, we excluded 
those reconstructed V1 neurons with axon collaterals beyond V1 that 
terminated abruptly without branching (n = 28; Extended Data Fig. 2 
and Supplementary Note 1), although the results below are robust 
to the inclusion of these cells (Extended Data Fig. 2e). We did not 
exclude neurons with abrupt terminations of contralaterally projecting 
branches (see also ref. 6), instead restricting our analysis to ipsilaterally 
projecting axons. 

We analysed the ipsilateral projection patterns of 38 pyramidal neu- 
rons in layer 2/3, including 31 neurons in area V1 (Fig. 1g and Extended 
Data Figs 3, 4) and 7 neurons in nearby higher visual areas (Extended 
Data Fig. 5). Inspection of individual axonal arborizations of V1 
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Figure 1 | Brain-wide single-cell tracing reveals the diversity of 
axonal projection patterns of layer-2/3 V1 neurons, with most cells 
projecting to more than one target area. a, Three hypothetical modes 
of information transfer from one source area to multiple target areas. 
Neurons (arrows) could each project to a single area (left) or to several 
areas either randomly (middle) or in predefined projection patterns 
(right). b, Maximum projection of the coronal view of a representative 
GFP-filled neuron acquired by serial-section two-photon microscopy. 
Auto-fluorescence from the red channel is used to show the brain’s 
ultrastructure (grey background). Scale bar, 600 1m. n = 71 neurons. 

c, d, Higher magnification of the medial (c) and lateral (d) axonal 
arborization of the example neuron. Scale bars, 300m. e, Horizontal 
section through a sample brain (cyan) and the Allen Reference Atlas 
(purple) before (left) and after (right) rigid and non-rigid transformation 
of the brain to the atlas. f, Coronal, horizontal and sagittal projections 
of the traced example neuron overlaid in the Allen Reference Atlas 
space. Target cortical areas are coloured as indicated. A, anterior; AL, 
anterolateral; AM, anteromedial; ECT, ectorhinal; LI, lateroitermediate; 
LM, lateromedial; P, posterior; PER, perirhinal; PM, posteromedial; 
POR, postrhinal; RL, rostrolateral; TEA, temporal association. Scale bars, 
1mm. g, Overlay of all traced single neurons (top left) and 11 example 
cells in Allen Reference Atlas space; horizontal view (top) and sagittal 


neurons revealed a high degree of diversity in the projections regarding 
the number and identity of target areas (Fig. 1g and Extended Data 
Figs 3, 4), whereas this diversity was not clear in the bulk projection 
data*®”! (Fig. 1g, top left). 

Almost all layer-2/3 cells projected out of V1 (97%, n = 30/31; Fig. 1h) 
to one or more of 18 target areas in the telencephalon (Fig. 1i), typi- 
cally innervating nearby cortical areas but occasionally also projecting 
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view (bottom) of each cell. Dashed outlines label non-visual target areas. 
AC, anterior cingulate cortex; AMYG, amygdala; STR, striatum. Note that 
these images are for illustration purposes only, because a two-dimensional 
projection cannot faithfully capture the true three-dimensional axonal 
arborization pattern. Scale bar, 1 mm. h, The fraction of traced single 
neurons that project to at least one target area outside V1 is shown in blue. 
At least 1 mm of axonal innervation is required for an area to be considered 
a target. i, Projection pattern of all GFP-filled V1 neurons targeted randomly 
(n= 31 neurons). The colour code reflects the projection strengths of each 
neuron, determined as axon length per target area, normalized to the axon 
length in the target area receiving the densest innervation. Only brain areas 
that receive input from at least one neuron, as well as the striatum, are 
shown. AUD, auditory cortex; ENT, entorhinal; HIPP, hippocampus; LA, 
lateral amygdala; RHIPP, retrohippocampal region; RS, retrosplenial. j, The 
number of projection targets for every neuron that projects out of V1. k, The 
proportion of cells targeting more than one area, when projection targets 
that receive projections weaker than the indicated projection strength are 
ignored. For each neuron, projection strengths are normalized to axon 
length in the target area receiving the densest innervation. I, The fraction of 
neurons projecting to each of the 18 target areas of V1. m, The fraction of 
neurons innervating a single target area (‘dedicated’ projection neurons) out 
of all neurons that innervate that area. 


to the anterior cingulate cortex, striatum (Extended Data Fig. 1) and 
amygdala. To mitigate errors arising both from technical noise in atlas 
registration and from subject-to-subject variability in the bounda- 
ries between brain areas, we excluded low-confidence buffer zones of 
1001m around the area boundaries from analysis, and included only 
those areas that received over 1 mm of axonal input from an individ- 
ual cell as targets (see Methods). Eighty-five per cent of all projection 
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patterns appeared only once, highlighting the diversity of long-range 
projections. 

The majority of reconstructed layer-2/3 projection neurons sent 
axon collaterals to more than one target area (77%, n = 23/30), with 
some targeting up to seven areas (Fig. 1j). Although individual neurons 
innervated different target areas with different axonal densities, and 
thus might influence the computations in one area more than another, 
we found that a large fraction of ‘broadcasting’ cells innervated more 
than one target with comparable strengths (Fig. 1k). Moreover, the total 
length of axons scaled with the number of target areas (average length 
per brain area=4.6 +2.2 mm), such that the innervation density per 
target was, on average, similar irrespective of how many targets an 
axon innervated (Extended Data Fig. 6a, b). The innervation in higher 
visual areas was most dense in layers 2/3 and 5, consistent with recent 
reports’, often recapitulating the pattern of lateral axonal projections 
of layer-2/3 cells within V1 (Extended Data Fig. 6c-h). 

Posterior, postrhinal (POR), lateromedial (LM) and PM visual areas 
were the most common targets of V1 neurons (Fig. 11). Even when 
the analysis was restricted to neurons that projected to at least one of 
the six nearby cortical visual areas (laterointermediate (LI), LM, AL, 
PM, anteromedial (AM) or rostrolateral (RL)), we found that half of 
these neurons projected to two or more of these areas (Extended Data 
Fig. 7a—e). The fraction of input provided by dedicated projection 
neurons to any area comprised no more than 25% of the total input 
(Fig. 1m), and most target areas received no dedicated input. These 
conclusions were robust to changes in the size of the border exclu- 
sion zone between neighbouring areas and the minimum projection 
strength in the target area (Extended Data Fig. 7f-h). Similar to projec- 
tions from V1, all seven reconstructed neurons, which had cell bodies 
that resided in nearby higher visual areas, also projected to more than 
one target area (Extended Data Fig. 5). Our results thus show that most 
layer-2/3 neurons distribute information to multiple areas, rather than 
project to single targets. 

Interestingly, the location of the cell body within V1 was predictive of 
projection target for some recipient areas (Extended Data Fig. 8). Given 
the retinotopic organization of V1, this suggests that visual information 
from different parts of visual field may be preferentially distributed to 
specific target areas, which is consistent with recent findings”®. 


High-throughput MAPseq tracing 

We next investigated whether broadcasting cells choose their cortical 
target areas independently, or whether they target specific subsets of 
areas. Although the targeting of different combinations of areas distin- 
guishes individual V1 projection neurons (Fig. 1), their classification 
into putative sub-types requires a demonstration of higher-order pro- 
jectional structure within the population. We define the higher-order 
structure in terms of the connection patterns predicted by the per- 
neuron (first-order) probability of projecting to each target. For exam- 
ple, if the probability of any given neuron projecting to area A is 0.5 
and the probability of projecting to area B is also 0.5 then we would 
expect P(AMB) = P(A) x P(B) = 0.25 of all neurons to project to both 
Aand B if the decision to target these areas is independent. Significant 
deviations from this expectation would indicate the organization of 
the projections into non-random projection motifs. Investigating the 
higher-order structure requires large datasets, because, if a sample size 
of n neurons is required to estimate the first-order probabilities, then 
a sample size of n* is needed to estimate pairwise probabilities with 
comparable accuracy. Although single-neuron reconstruction provides 
very high spatial resolution, the tracing of axons remains highly labour 
intensive despite increases in throughput for data acquisition!””°. 

We therefore used a higher-throughput strategy, MAPseq”, to 
obtain the required number of single-neuron projections for higher- 
order statistical analysis. Ina MAPseq experiment, hundreds or thou- 
sands of neurons are labelled uniquely with random RNA sequences 
(barcodes) by a single injection of a library of barcoded Sindbis viruses 
(Supplementary Note 2). The barcodes are expressed and then actively 
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transported into the axonal processes of each labelled neuron, where 
they can be analysed by high-throughput barcode sequencing after 
dissection of potential target areas. The abundance of each barcode 
sequence in each area serves as a measure of the projection strength of 
the corresponding barcode-labelled neuron. MAPseq simultaneously 
maps the projections of all labelled neurons of dissected target areas, 
and therefore enables in-depth analysis of projections to a smaller set 
of targets. 

We used MAPseq to map the projection patterns of 553 neurons 
from V1 to six higher visual areas—LI, LM, AL, AM, PM and RL— 
that can be identified reliably by intrinsic signal imaging in vivo and 
dissected ex vivo for barcode sequencing (Fig. 2a, b, Extended Data 
Fig. 9 and Methods). To prevent the virus from spreading from V1 to 
adjacent areas, we made small focal injections of the MAPseq virus 
to yield 100-200 traced cells per mouse. Consistent with the analysis 
of fluorescence-based single-neuron reconstructions restricted to the 
six higher visual areas (Fig. 2c, left), almost half (44%) of all MAPseq 
neurons projected to more than one area (Fig. 2c, right). Furthermore, 
the projection patterns obtained by fluorescence-based tracing were 
statistically indistinguishable from those obtained by MAPseq (using 
a bootstrap procedure; see Supplementary Note 3), whereas randomly 
generated neurons with projection strengths sampled from a uniform 
distribution were markedly different (Fig. 2d). Therefore, the findings 
from the MAPseq dataset were consistent with those from the sin- 
gle-neuron tracing dataset. 

We first catalogued the diversity of single-neuron projection patterns 
from V1 to six higher visual areas by unsupervised clustering of the 
MAPseq dataset (k-means clustering with a cosine distance metric). 
These projectional data were best described by eight clusters (Fig. 2e, 
Extended Data Fig. 10), of which all but one contained cells targeting 
more than one area. The most common combination of broadcasting 
neurons involved areas LM and PM, consistent with the fact that a large 
fraction of neurons targeted these areas and the suggestion of LM” and 
PM as integrative hubs of V1 signals, similar to the secondary visual 
cortex in the monkey (Fig. 2f). 

To investigate whether non-random projection motifs existed in the 
MAPseq dataset, we measured the likelihood of specific bi-, tri- or 
quadrifurcations and compared them to their expected probabilities 
(assuming independence between each projection type; Fig. 3a, b). 
This analysis identified six projection motifs that were significantly 
over- or underrepresented after a correction for multiple compari- 
sons (Bonferroni adjustment; Fig. 3b, c). Together, these six projection 
motifs represented 73% of all broadcasting cells that were identified by 
MAPsegq. Therefore the majority of V1 cells projecting to multiple tar- 
get areas do so ina non-random manner, suggesting that broadcasting 
motifs reflect several sub-classes of projection neurons for divergent 
information transfer from V1 to higher visual areas. 

The most underrepresented broadcasting motif was the bifurcation 
between PM and AL (Fig. 3d). These two areas exhibit distinct visual 
response properties'*!* and receive functionally specialized input from 
V1’, consistent with the idea of exclusive projections from V1 into these 
areas. Moreover, the underrepresented population of neurons, which 
project to both PM and AL, was further split into two groups according 
to projection strength; one population primarily innervates PM and 
another primarily innervates AL (Fig. 3d). A second underrepresented 
motif is the bifurcation between PM and LM (Fig. 3e). However, in 
contrast to the PM-AL bifurcation, the detected PM-LM-projection 
neurons do not clearly separate into two classes. Our findings therefore 
provide an anatomical substrate for the previously reported functional 
dichotomy of AL and PM areas, and suggest that a few ‘dedicated’ out- 
put channels can co-exist with the prevalence of broadcasting cells that 
co-innervate multiple targets. 

In addition to the two underrepresented projectional motifs, we 
identified four overrepresented motifs, that is, combinations of tar- 
get areas that receive more shared input from individual V1 neurons 
than expected from first-order projection statistics (Fig. 3f-h). Cells 
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Figure 2 | MAPseq projection mapping reveals a diversity of projection 
motifs. a, Overview of the MAPseq procedure. Six target areas were 
chosen for analysis: LI, LM, AL, PM, AM and RL. b, Projection strength 
in the six target areas, as well as the olfactory bulb (OB) as a negative 
control, of 553 neurons mapped using MAPseq. Projection strengths per 
neuron are defined as the number of barcode copies per area, normalized 
to the efficiency of sequencing library generation and to the neuron’s 
maximum projection strength (n= 4 mice). c, Number of projection 
targets of V1 neurons when considering the six target areas only, based on 
the fluorescence-based axonal reconstructions (left) or the MAPseq data 
(right). d, Distribution of cosine distances obtained by a bootstrapping 
procedure (1,000 repeats) between MAPseq neurons (blue), fluorescence- 
based single-neuron reconstructions and MAPseq neurons (orange), 

or random neurons (with projection strengths sampled from a uniform 


that innervated both PM and AM were significantly more abundant 
than expected by chance (Fig. 3f). Resolving the projection strengths 
within this motif revealed two subpopulations of neurons, one that 
innervates PM more than AM, the other innervates both areas with 
similar strength. Moreover, neurons bifurcating to LM and AL were 
also highly overrepresented (Fig. 3g) and comprised the most abun- 
dant class of broadcasting cells (Fig. 3b). The most significantly over- 
represented trifurcation motif was the projection to PM, LM and LI, 
comprising a relatively homogenous population that projects to LM 
and PM with similar strengths while projecting slightly less strongly to 
LI (Fig. 3h). Finally, we discovered that trifurcation between PM, AM 
and RL was overrepresented, but it appeared only rarely in our dataset 
(Fig. 3b). These motifs did not arise from false negatives (undetected 
connections) or false positives (Supplementary Note 4 and Extended 
Data Fig. 2f). 

These projectional data have implications for the categorization of 
higher visual areas into putative streams of visual processing in mouse 
neocortex. Areas AL and PM on the one hand, and LM and LI on the 
other, have been suggested to belong to dorsal and ventral processing 
streams in the mouse visual system, respectively*!~*°. Given that these 
areas receive a high degree of shared input (for example, LM-PM 
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distribution) and MAPseq neurons (yellow). The distance distributions 
obtained from MAPseq neurons and fluorescence-based single-neuron 
reconstructions are statistically indistinguishable (Kolmogorov-Smirnov 
one-sided two-sample test; P= 0.94; a = 0.05), whereas the distributions 
obtained from both MAPseq neurons or fluorescence-based reconstructed 
neurons are statistically different from the distribution obtained using 
random neurons (Kolmogorov—Smirnov two-sample test; P< 1073; 
a=0.05). e, Centroids and example cells for eight clusters obtained by 
k-means clustering of all MAPseq cells using a cosine distance metric. 
Target areas are coloured to indicate the projection strength of the plotted 
neuron. Projection strengths are normalized as in b. f, The probability of 
projecting to one area (area A) given that the same neuron is projecting to 
another area (area B) based on the MAPseq dataset. 


bifurcation, which was still abundant even though it was underrep- 
resented; AL-LM bifurcation; PM-LM-LI trifurcation), such a dis- 
tinction is unlikely to originate as a result of segregated V1 input into 
these areas. 


Discussion 

In summary, our results reveal some of the principles by which single 
neurons in one cortical area distribute information to downstream 
target areas. Almost all layer-2/3 pyramidal cells projected outside of V1, 
indicating that V1 neurons concurrently participate in local and distal 
computations. We found that the single-neuron projections outside V1 
were highly diverse, innervating up to seven targets, predominantly 
in specific, non-random combinations (Extended Data Fig. 10g, f). 
These results suggest a functional specialization of subpopulations of 
projection cells beyond ‘one neuron-one target area mapping. 

The fraction of neurons in V1 that broadcast information to multiple 
targets is considerably greater than has previously been indicated 
using retrograde tracing methods”*!*. This difference is unlikely to 
be caused by differences in the sensitivity with which these approaches 
detect the projection patterns of individual cells. Instead, anterograde 
tracing maps projections to many or all targets simultaneously, whereas 
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Figure 3 | Over- and underrepresented projection motifs of neurons 

in V1. a, The null hypothesis of independent projections to two target 
areas (top) and an example deviation (overrepresented bifurcation) from 
the null hypothesis (bottom). b, The observed and expected abundance 

of all possible bi-, tri- and quadrifurcation motifs in the MAPseq dataset. 
Significantly over- or underrepresented motifs, based on a binomial test 
with Bonferroni correction (see Methods), are indicated by black and grey 
arrowheads, respectively. n = 553 neurons from four mice. c, Statistical 


retrograde tracing typically analyses only two or three potential target 
sites at a time. Because the fraction of neurons projecting to any pair of 
targets selected for retrograde tracing is relatively low (typically <10%), 
most neurons will not be doubly labelled in any given experiment; only 
by sampling many potential targets in a single experiment can the true 
prevalence of broadcasting be uncovered. Indeed, if we simulate dou- 
ble retrograde tracing based on our MAPseq results, the fractions of 
bifurcating neurons are comparable to those observed when using 
retrograde methods in primates”*'®!* (Supplementary Table 1). 

We speculate that dedicated projection neurons—which comprise 
the minority of neurons in V1—convey specialized visual information 
that is tailored to their target area, as has previously been suggested®"". 
Indeed, the most underrepresented projection motif from V1, the 
PM-AL bifurcation, innervates two target areas with distinct preferences 
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significance of over- and underrepresented broadcasting motifs and 
associated effect sizes, based on a binomial test with Bonferroni correction 
(see Methods). »=553 neurons from four mice. d-h, The projection 
strengths of the individual neurons (one per line) giving rise to the six 
underrepresented (d, e) and overrepresented (f-h) projection motifs. For 
each neuron, the projection strength in each target area is normalized 

to the neuron’s maximum projection strength. Lines of the same colour 
represent neurons mapped in the same brain (n =4 mice). 


for visual features'**, By contrast, we suggest that the majority of cells 
encode information that is shared and in a form that is suitable for 
generating visual representations or multimodal associations across 
subsets of areas. Indeed, those target areas that are preferentially co-in- 
nervated by broadcasting neurons appear to have more similar visual 
response properties!*'*, Broadcasting cells may also coordinate activ- 
ity among the subset of areas that they co-innervate, thus providing a 
signal that links different processing streams. The divergent nature of 
signal transmission from a primary sensory cortex to its targets may 
therefore help to constrain models of hierarchical sensory processing. 
The existence of distinct projection motifs that either avoid or favour 
subsets of target areas suggests that sub-types of intracortical projection 
neurons exist and raises the question of how these specific, long-range 
connectivity patterns are established during development. 
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METHODS 


Mice. The anatomical single-cell tracing experiments were conducted at The 
Biozentrum, University of Basel, Switzerland. We obtained licenses and performed 
all experimental procedures in accordance with Basel Canton animal welfare guide- 
lines using both male and female adult (>8 weeks of age) C57BL/6 mice. Detailed 
protocols and all software are available at http://mouse.vision/han2017. 
Fluorescence-based single-neuron tracing. Two-photon guided single-cell electro- 
poration. We performed surgery as previously described™. In brief, we anaesthe- 
tized mice with a mixture of fentanyl (0.05 mg kg~'), midazolam (5 mg kg~') and 
medetomidine (0.5 mg kg” !), and maintained stable anaesthesia by isoflurane (0.5% 
in O2). We performed all electroporation experiments on a custom linear scanning 
two-photon microscope, equipped to image both a green and a red channel and 
running ScanImage 5.1». For electroporation, we used a patch pipette (12-16 MQ) 
filled with plasmid DNA (pCAG-eGFP (Addgene, 11150) or pAAV-EF1la-eGFP- 
WPRE (gift from B. Roska; sequence file can be found in the Supplementary 
Information, 100 ngjl~!) and AlexaFluor 488 (504M) in intracellular 
solution, and delivered electroporation pulses (100 Hz, —14 V, 0.5-ms duration 
for 1s) with an Axoporator 800A (Molecular Probes) when pushed against a 
target cell. We verified successful electroporation by dye filling of the cell body, 
and then sealed the skull with a chronic window using 1.5% agarose in HEPES- 
buffered artificial cerebrospinal fluid and a cover slip. We confirmed plasmid 
expression two days after electroporation by visualization of GFP epifluorescence 
through the chronic imaging window. Three to ten days after electroporation, we 
transcardially perfused anaesthetized mice with 10 ml 0.9% NaCl followed by 50 ml 
4% paraformaldehyde in 0.1 M phosphate buffer (pH 7.4). We removed the brains 
from the skull and post-fixed them in 4% paraformaldehyde overnight at 4°C. 
We then stored the fixed brains in PBS at 4°C until imaging with serial-section 
two-photon tomography. 

Serial-section two-photon tomography. We embedded the fixed brains in 5% 
oxidized agarose (derived from type-I agarose (Sigma-Aldrich)) and covalently 
cross-linked the brain to the agarose by incubation in an excess of 0.5-1% sodium 
borohydride (NaBHy, Sigma-Aldrich) in 0.05 M sodium borate buffer overnight 
at 4°C. We then imaged embedded brains using a Tissue Vision two-photon scan- 
ning microscope”*°, which cut physical sections of the entire brain every 501m 
coronally, and acquired optical sections every 101m in two channels (green chan- 
nel: 500-560 nm; red channel: 560-650 nm) using 940-nm excitation laser light 
(Mai Tai eHP, Spectraphysics). Each imaged section is formed from overlapping 
800 x 800-|1m ‘tiles. We imaged with a resolution of 11m in x and y and measured 
an axial point spread function of ~51m FWHM (full width at half maximum) 
using ScanImage 5.1. 

Image processing and cell tracing. We stitched raw image tiles using custom 
MATLAB-based software, StitchIt. StitchIt applies illumination correction based 
on the average tiles for each channel and optical plane, and subsequently stitches 
the illumination-corrected tiles from the entire brain. We then navigated through 
the stitched brain space using MaSIV (https://github.com/alexanderbrown/masiv), 
a MATLAB-based viewer for very large 3D images, and traced axons using a 
custom, manual neurite-tracing extension for MaSIV. The tracer was not blinded, 
as no comparison across experimental conditions was performed. No power 
calculations were performed. 

To assign each voxel of the imaged brains to a brain area, we segmented each 
brain using areas defined by the Allen Reference Atlas (Common Coordinate 
Framework v.3, © 2015 Allen Institute for Brain Science, Allen Brain Atlas API, 
available from http://brain-map.org/api/index.html), after smoothing with a single 
pass of a Gaussian kernel with an s.d. of 0.5 using the Nifty ‘seg-maths’ tool as 
described previously””. In brief, we downsampled one imaging channel to a voxel 
size of 251m and converted it to MHD format using StitchIt. We then registered the 
volume to the average template brain of the Allen Reference Atlas using Elastix*® 
by applying rigid affine transformation followed by non-rigid deformation with 
parameters as previously described*”"°. We examined registration quality using 
a custom Python/PyQt5 application, Lasagna, which overlays the Allen template 
brain and the registered sample brain and is extendable to allow the overlay of 
traced cells, or the overlay of area borders from the Allen Reference Atlas onto 
a downsampled brain. To transform the traced cells into Allen Reference Atlas 
space (sample to the Allen Reference Atlas), we calculated the inverse transform 
to the one calculated by Elastix (Allen Reference Atlas to sample) and applied this 
to the traced points. 

Analysis of traced neurons. To prevent potential incomplete filling of neurons from 
biasing the results of our analyses, we excluded cells with non-arborizing primary 
branches in the ipsilateral hemisphere from the analysis. Out of a total of 71 traced 
cells, we excluded 28 cells that exhibited abrupt, non-callosal terminations, as well 
as 5 cells that were back-labelled from the striatum, thus restricting our analysis to 
ipsilateral projection patterns of 31 cells in V1 and 7 in other higher visual areas. 
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Moreover, axonal branches terminating contralaterally or after entering the corpus 
callosum were considered as callosal terminations and were included in the analy- 
sis (see also ref. 6). We calculated the first-order projection statistics only using the 
cells registered in the Allen Reference Atlas that satisfied these criteria. To reduce 
any artefacts associated with registration in the Allen Reference Atlas or individual 
brain variability in boundaries between brain areas, we excluded any axon within 
501m from any brain area boundary from the analysis. We then calculated the 
projection strength of each neuron to each area as the total length of axon of 
that neuron in an area. To determine the number of projection targets for 
every cell, we used a minimum projection strength of 1-mm axon length per 
target area. 

MAPseq. MAPseq sample processing. To define the V1 injection site and target 
higher visual areas (LI, LM, AL, PM, AM and RL), we used optical imaging of 
intrinsic signals as previously described'**", In brief, we first implanted a custom- 
ized head plate and then thinned the skull to increase its transparency. After 2-3 
days of recovery, we sedated the mice (chlorprothixene, 0.7 mg kg!) and lightly 
anaesthetized them with isoflurane (0.5-1.5% in O3), delivered via a nose cone. 
We illuminated the visual cortex with 700-nm light that was split from an LED 
source into two light guides, performing imaging with a tandem lens macroscope 
focused 250-500 1m below the cortical surface and a bandpass filter centred at 
700 nm with 10-nm bandwidth (67905; Edmund Optics). We acquired images 
at 6.25 Hz with a 12-bit CCD camera (1300QF; VDS Vosskithler), frame grab- 
ber (PCI-1422; National Instruments) and custom software written in LabVIEW 
(National Instruments). We visually stimulated the contralateral eye of mice with 
a monitor placed at a distance of 21 cm and presented 25-35° patches of 100% 
contrast square wave gratings with a temporal frequency of 4 Hz and a spatial 
frequency of 0.02 cycles per degree for 2s followed by 5s of grey screen (mean 
luminance of 46 candela per m’). To establish a coarse retinotopy of the targeted 
area, we alternated the position of the patches: we used two different elevations 
(approximately 0 and 20°) and two different azimuths (approximately 60 and 90°); 
at each position, we acquired at least 17 trials. We obtained intrinsic signal maps 
by averaging the responses during the stimulation time using Image] (National 
Institute of Mental Health, NIH) and mapping the location of the estimated spots 
of activation onto a previously acquired blood vessel picture. 

We then pressure-injected (Picospritzer III, Parker) 100 nl of 1 x 10'° genome 
copies ml! barcoded MAPseq Sindbis virus®’ with a diversity of >8 x 10° different 
barcode sequences unilaterally at a depth of 100-200 |1m from the brain surface 
into V1 of four 8-10-week-old C57BL/6 female mice. In addition, we labelled the 
six higher visual areas by placing a Dil-coated micropipette into retinotopically 
matched positions according to intrinsic signal maps. For this, we allowed 2-5 11 
of 2.5mg ml! Dil (Invitrogen D3911) in ethanol solution to dry on the outside of 
a pulled micropipette tip until some Dil crystals were visible. Mice were euthanized 
44-48 h after injection by decapitation, and their brain immediately extracted and 
flash-frozen on dry ice. 

We cut 180-j1m thick coronal sections using a cryostat at —10°C blade and 
sample holder temperature, and melted each slice onto a clean microscope slide 
before rapidly freezing it on dry ice again. We then dissected each target area and 
the injection site using cold scalpels while keeping the brain sections frozen on a 
metal block cooled to approximately —20°C in a freezing 2.25 M CaCl, bath”. 
During dissection, we identified each dissected area using a fluorescent dissection 
microscope to visualize viral GFP expression and Dil stabs labelling each target 
area (Extended Data Fig. 7). Throughout the procedure, we took care to avoid 
sample cross-contamination by never reusing tools or blades applied to differ- 
ent areas and changing gloves between samples. To measure noise introduced by 
contamination, we collected samples of the olfactory bulb from each brain, which 
served as a negative control. 

We then processed the dissected samples for sequencing mostly as previ- 
ously described*”, but pooling all samples after first-strand cDNA synthesis. In 
brief, we extracted total RNA from each sample using TRIzol reagent (Thermo 
Fisher) according to the manufacturer’s instructions. We mixed the sample RNA 
with spike-in RNA (obtained by in vitro transcription of a double-stranded 
ultramer with sequence 5’-GITCATGATCATAATACGACTCACTATAG 
GGGACGAGCTGTACAAGTAAACGCGTAATGATACGGCGACCACCGAGA 
TCTACACTCTTTCCCTACACGACGCTCTTCCGATCTNNNNNNNNNNNN 
NNNNNNNNNNNNATCAGTCATCGGAGCGGCCGCTACCTAATTGCCG 
TCGTGAGGTACGACCACCGCTAGCTGTACA-3/ (IDT)*”) and reverse 
transcribed the RNA mixture using a gene specific primer 5/-CTTGGCACCCGA 
GAATTCCANNNNNNNNNNNNXXXXXXXXTGTACAGCTAGCGGTGGT 
CG-3’, where Xg is one of >300 true-seq-like sample-specific identifiers and 
Ny is the unique molecular identifier, and SuperscriptIV Reverse Transcriptase 
(Thermo Fisher) according to the manufacturer’s instructions. We then 
pooled all first-strand cDNAs, purified them using SPRI beads (Beckman 
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Coulter) and produced double-stranded cDNA as previously described**. 
We then treated the samples using Exonucleasel (NEB) and performed two 
rounds of nested PCR using primers 5’-CTCGGCATGGACGAGCTGTA-3/ 
and 5’‘-CAAGCAGAAGACGGCATACGAGATCGTGATGTGACTGGAG 
TTCCTTGGCACCCGAGAATTCCA-3’ for the first PCR and primers 
5'-AATGATACGGCGACCACCGA-3’ and 5’-CAAGCAGAAGACGGCA 
TACGA-3’ for the second PCR using Accuprime Pfx polymerase (Thermo Fisher). 
Finally, we gel-extracted the resulting PCR amplicons using Qiagen MinElute 
Gel extraction kit according to the manufacturer's instructions and sequenced 
the library on a Illumina NextSeq500 high-output run at paired-end 36 using the 
SBS3T sequencing primer for paired-end 1 and the Illumina small RNA sequencing 
primer 2 for paired-end 2. 

MAPsegq data analysis. On the basis of the sequencing results, we constructed a 
barcode matrix M of size of (number of barcodes) x (number of dissected areas) 
with entry Mj, representing the absolute counts of barcode i in area j as previously 
described*°. We de-multiplexed the sequencing results, extracted the absolute 
counts of each barcode in each sample based on the UMI sequence and error- 
corrected the barcode sequences, before matching barcode sequences to the virus 
library and constructing matrix M by matching barcode sequences across areas. 
We then filtered the barcode matrix for ‘high-confidence’ cell bodies inside the 
dissected area of V1 by requiring a minimum of 10 counts in at least one target area, 
an at least tenfold difference between the cell body location in V1 and the most 
abundant target area in data normalized to the efficiency of library production 
as measured by the amount of recovered spike-in RNA counts, and an absolute 
minimum barcode count of 300 in V1. We then normalized the raw barcode counts 
in each area to the relative spike-in RNA recovery to the olfactory bulb sample, 
merged the results from all four processed brains into a single barcode matrix and 
used this matrix for all further analysis. 

To determine whether a particular neuron projected to any given target area, we 
chose a conservative threshold of at least 5 barcode counts, based on the highest 
level of barcode expression in the olfactory bulb negative control sample. 
Calculation of statistical significance of projection motifs. To calculate the statistical 
significance of broadcasting projection motifs, we compared against the simplest 
model in which we assumed that each neuron projected to each area independently. 
To generate predictions of this model, we first estimated the probability of pro- 
jecting to each area, assuming independent projections. We define the probability 
P(Aj) that a given neuron projects to the ith area A; as 


Nai 


P(Aj) = 
Nootal 


in which N4; is the number of neurons in the sample that project to area Aj, 
i=1...k for k analysed target areas, and Njota is the total number of neurons in the 
sample. 

In our MAPseq experiments, we do not have direct access to Niotal since for 
technical reasons we only include neurons that have at least one projection among 
the dissected targets. Because, in principle, some neurons might project to none of 
the areas dissected (see Fig. 3a), failure to include these would lead to an underes- 
timation of Nota. However, assuming independence of projections, we can infer 
Nwotai from the available measurements. 

To estimate Niotal, We first observe that 


P(project to at least one area) + P(project to no area) = 1 
k 

Nobs ae Naj Jn. 

Mootal MNeotal 


jet 


where Nop; is the total number of neurons observed to project to at least one area. 
For k= 6 areas, we can expand this expression to 
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Noting that this is a quintic equation in Nita, We can use a root finder to solve for 
MNtotal HUMerically, and use the result to calculate P(A)). 

Using the derived Njotal and P(A;), we can calculate the P value for every possible 
broadcasting motif by calculating the value of the binomial cumulative distribution 
function, for a total of Njota tries, the empirical number of observed counts (suc- 
cesses), and P(motif) assuming independent projections. We calculated the P value 
of all possible bi-, tri- and quadrifurcations, and determined significantly over- or 
underrepresented broadcasting motifs at a significance threshold of «= 0.05 after 
Bonferroni correction. 

Code availability. All software is available at http://mouse.vision/han2017. 
Data availability. All sequencing data are publicly available in the Sequence 
Read Archive under accession numbers SRR5274845 (ZL097 for mouse 4 
and mouse 5) and SRR5274844 (ZL102 for mouse 6 and mouse 7). All single-cell 
tracing results are accessible at http://mouse.vision/han2017 and on neuromorpho 
at http://neuromorpho.org/NeuroMorpho_ArchiveLinkout.jsp? ARCHIVE= 
Han_etal. 
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Extended Data Figure 1 | Single-neuron tracing protocol efficiently 
fills axons projecting to the ipsilateral striatum. We retrogradely 
labelled striatum-projecting cells by stereotactically injecting cholera 
toxin subunit B conjugated to AlexaFluor 594 or PRV-Cre into the visual 
striatum of wild-type mice or tdTomato-reporter mice (Ail4, Jax), 
respectively (magenta). With visual guidance of two-photon microscopy, 
we electroporated single retrogradely labelled cells in V1 with a GFP- 
expressing plasmid (cyan). a, Coronal, maximum intensity projections 
of visual striatum. Scale bar, 1 mm. b, Higher magnification image of the 
visual striatum. Scale bar, 0.2 mm. c, Single-channel images of the same 


ARTICLE 


YH297_02 


axonal arborization as in b. d, Coronal maximum intensity projection 
containing V1. Scale bar, 1 mm. e, Higher magnification image of V1. 
Scale bar, 0.2 mm. f, Single-channel images of V1. Scale bars, 0.2 mm. 

g, Horizontal projections in the Allen Reference Atlas space of eight 
retrogradely labelled and electroporated cells. Cell ID numbers are 
indicated at the top right of each image. Scale bars, 1 mm. Note that 

one additional cell was retrogradely labelled and electroporated, which 
revealed its axonal projection to the striatum, but it is not shown because 
the brain was too distorted to allow accurate registration to the Allen 
Reference Atlas. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


Abrupt terminal 


a. 
oO 


Cells without Cells with All cells 


abrupt terminations abrupt terminations T targets 
6 targets 2% (1 cell) 
5% (1 cell) \ 


3 


6 targets 

2% (1 cell) 

il 5 targets 
10% (5 cells) 


7 targets 

3% (1 cell) 5 targets 

10% (3 cells) 1 target 
26% (5 cells) 


o 


5 targets 
11% (2 cells) 1 target 


1 target 
24% (12 cells) 


23% (7 cells; 


o 


4 targets 


7% (2 cells) 4 targets 


110% (5 cells) 


x 


| 4 targets 


16% (3 cells) 
3 targets 2 targets 


2 targets 
3 targets 16% (cells) 26% (cells, 22% (11 cells) 
30% (2 cells) —— 


i) 


Distance of premature 
termination to soma (mm) 


3 targets 
2 targets py 2% (14 cells) 
% 3 4 6 8 to 27% (8 cells) 
Distance of farthest non-premature 


termination to soma (mm) 


= 


0.5, 0.5 05 0.5 
PM-AM AL-PM LM-PM LM-AL 
0.4 04 04 04 
oO oO oO oO 
303 303 303 303 
o oC | o | fe o 
> i > | > & > 
a 2.2 }\ a 0.2 a 22)\ ao 02 
rh y \ 
0.1), 0.4) \ O11 \ 0.1; a 
a a Que, Qe o> ae 
04 06 08 10 12 14 16 18 20 04 06 08 10 12 14 16 18 2.0 04 06 08 10 12 14 16 18 2.0 04 06 08 10 12 14 16 18 20 
Dropfraction Dropfraction Dropfraction Dropfraction 
0.5 0.5 
PM-AM-RL LI-LM-PM 
0.4 
o | 
303 
oO 
> 
a 02 
04 é 
ok Se, =A 0 _ 
04 06 08 10 12 14 16 18 2.0 04 06 08 10 12 14 16 1.8 2.0 


Dropfraction Dropfraction 


Extended Data Figure 2 | See next page for caption. 
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Extended Data Figure 2 | Some axonal branches terminate abruptly 
without arborizing, whereas other branches of the same neuron 
arborize extensively within different target areas and appear to be 
completely filled. a, Horizontal view of a representative cell in the Allen 
Reference Atlas space. The abrupt termination is labelled with a purple 
square. n = 28 abruptly terminating cells. b, The abrupt termination of the 
example cell shown as a maximum Z projection (left) and in the individual 
z sections (right). c, Two normal terminations of the same cell, shown as a 
maximum z projection (left) and in two colour-coded series of z sections 
(right). d, Distance of abrupt termination from cell body versus distance 
of furthest regular termination of the same cell. Dashed line indicates the 
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unity line. e, The distribution of target numbers of all projection neurons 
without abrupt terminations (as shown in the main figures; left), of 
projection cells with abrupt terminations (middle) and of all projection 
neurons (no abrupt terminations and abrupt terminations; right). f, To test 
the effect of false negatives on our analyses, we simulated the random loss 
or gain of projections from the MAPseq dataset, while maintaining overall 
area projection probabilities. n = 553 neurons; 400 repeats. P values based 
on a binomial test for all six projection motifs determined as significantly 
over- or underrepresented in our dataset are plotted after removing 
(dropfraction <1) or adding (dropfraction >1) connections. Mean (black 
line) and s.d. (shaded area) are indicated. 
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Extended Data Figure 3 | Images of traced layer-2/3 V1 neurons. Horizontal views of the Allen Reference Atlas space are shown, and cell ID numbers 
are indicated at the top right of each image. Scale bars, 1 mm. 
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Extended Data Figure 4 | Images of traced layer-2/3 V1 neurons, continued. Horizontal views of the Allen Reference Atlas space are shown, and cell 


ID numbers are indicated at the top right of each image. Scale bars, 1 mm 
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Extended Data Figure 5 | Individual neurons in higher visual areas project to more than one target area. a, All traced neurons with cell bodies not in 
V1. Brain area identity is colour coded as in Fig. 1. Cell identity is indicated at the top right of each image. Scale bars, 1 mm. b, Histogram of the number 
of target areas per cell. 
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Extended Data Figure 6 | Density of axonal innervation by area and 
layer of V1 layer-2/3 projection neurons. a, Total axon length plotted 

as a function of the number of targets innervated by every V1 projection 
neuron. b, Axon length in area LM, PM or POR plotted as a function of 
the total number of targets innervated by each neuron projection to the 
respective area. c—h, The axons of V1 neurons in target areas most densely 
innervate layers 2/3 and 5, with some density in layer 1, but less in layers 4 
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and 6, often recapitulating the laminar axonal profile within V1. Coronal 
views of each area are shown in Allen Reference Atlas space (left) and 
axonal arborizations of each neuron innervating the area are colour coded. 
Scale bars, 200m. A histogram of the laminar innervation is shown 
(right). Note that cells with abrupt terminations outside the shown area 
were included in this analysis. Areas depicted are V1 (c), AL (d), LI (e), 
LM (f), PM (g) and POR (h). White-matter axons are not shown. 
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Extended Data Figure 7 | Conclusions from fluorescence-based single- 
neuron tracing data hold true when the analysis is restricted to subset 
of target areas. a, The projection patterns of reconstructed GFP-filled 
neurons when only the six target areas (LI, LM, AL, PM, AM and RL) 

are considered. Projection strengths are normalized to the maximum 
projection of each neuron, and only neurons projecting to at least one 
target area are shown. b, The distribution of target area numbers per 


projection neuron. ¢, The fraction of all cells projecting to each target area. 


d, The fraction of dedicated input per area. e, The number of times each 
binarized projection motif is observed. f, The fraction of broadcasting 
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cells as a function of the minimum projection strength (relative to 

the primary target) that each area needs to receive to be considered a 
target. g, The fraction of broadcasting cells as a function of increasing 
buffer zones between areas within which axons are ignored, assuming a 
minimum projection of 1 mm of axon per target area. h, The fraction of 
broadcasting cells as a function of the minimal amount of axon per area 
for it to be considered a target, assuming buffer zones of 100-j1m width. 
Black arrowheads indicate chosen buffer zones and minimal projection for 
analysis in the paper. 
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Extended Data Figure 8 | Location of cell bodies in V1 as a function of 
their projection targets. a-1, Horizontal views of Allen Reference Atlas 
space are shown. The location of all traced V1 neurons are indicated as 
circles (cells with no abrupt terminations) or squares (cells with abrupt 
terminations). In each plot, the cells projecting to the highlighted higher 
visual area are coloured in solid blue. Target areas considered are A (a), 
AL (b), AM (c), ECT (d), LI (e), LM (f), P (g), PER (h), PM (i), POR 

(j), RL (k) and TEA (1). m, n, Quantification of cell body location in the 
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rostrocaudal (m) and mediolateral (n) direction. Dotted lines indicate 
expected number of cells based on a bootstrapping procedure, for which 
we randomly selected neurons from the available positions to project to 
each area and repeated the process 10,000 times. P values were derived 
from the bootstrapping probability distribution and are indicated for 
projection targets significantly deviating from this expectation (a = 0.05). 
P values below 10~‘ are not exact and are therefore indicated as a range. 
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Extended Data Figure 9 | MAPseq dissection strategy. We identified 
the to-be-dissected higher visual areas by performing intrinsic imaging 
of visual cortex in response to stimuli at different positions in the 
contralateral visual field and mapping the resulting changes in intrinsic 
signals. a, A representative retinotopic map, with responses to the two 
25° visual stimuli pseudocoloured in green and magenta (stimulus 1 
position: 90° azimuth, 20° elevation; stimulus 2 position: 60° azimuth, 
20° elevation). On the basis of this map, we fluorescently labelled 
retinotopically matched positions in the to-be-dissected cortical areas 


Slice 3 Slice 4 Slice 5 


A 


Slice 3 Slice 4 


Slice 8 Slice 9 Slice 10 


with a Dil stab (white circles). Putative borders between the higher visual 
areas are indicated with dashed lines for orientation. Scale bar, 1 mm. n=4 
mice. b, The MAPseq virus injection site is discernible in consecutive 
frozen 180-|1m thick coronal sections, using GFP fluorescence. Scale bars, 
1mm. ¢, Dil injections targeted to matched retinotopic positions in six 
target areas identified by intrinsic signal imaging. Dil epifluorescence 
images of each 180-1m thick slice are shown, and dissected areas are 
labelled. Scale bars, 1 mm. 
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Extended Data Figure 10 | Clustering of MAPseq data and data 
summary. a, b, Gap (a) and silhouette criteria (b) for k-means clustering 
of the MAPseq neurons as a function of the number of clusters. Black 
arrow heads indicate chosen number of clusters (k = 8). c, d, Centroids for 
alternative, near-optimal cluster number choices with k =3 (c) andk=5 
(d). e, Hierarchical clustering results of the MAPseq dataset using a cosine 
distance metric. c-e, Colour intensity indicates projection strengths. 

f, g, Summary of single-neuron projections from V1. f, Cells targeting 
single higher visual areas (dedicated projection neurons) comprise the 


Projection strength 


Broadcasting projection neurons 
77% brain-wide; 
50% to six higher visual areas 


Under-represented Consistent with chance* 


Wm 50 neurons 
* Motifs present >4 neurons 


minority of layer-2/3 V1 projection neurons. Among the areas analysed by 
MAPseq, dedicated projection neurons predominantly innervate cortical 
areas LM or PM. g, Cells projecting to two or more areas (broadcasting 
projection neurons) are the dominant mode of information transfer 

from V1 to higher visual areas. In the six areas analysed by MAPseq, 
broadcasting neurons innervate combinations of target areas in a non- 
random manner, including those that are more or less abundant than 
expected by chance. Line width indicates the absolute abundance of each 
projection type as observed in the MAPseq dataset. 
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Evolved Cas9 variants with broad PAM 
compatibility and high DNA specificity 


Johnny H. Hu!, Shannon M. Miller!??, Maarten H. Geurts!*, Weixin Tang), Liwei Chen! *, Ning Sun!? 
Christina M. Zeinal23 , Xue Gaol? , Holly A. Rees!*3, Zhi Lin! 3 & David R. Livh23 


A key limitation of the use of the CRISPR-Cas9 system for genome editing and other applications is the requirement that 
a protospacer adjacent motif (PAM) be present at the target site. For the most commonly used Cas9 from Streptococcus 
pyogenes (SpCas9), the required PAM sequence is NGG. No natural or engineered Cas9 variants that have been shown to 
function efficiently in mammalian cells offer a PAM less restrictive than NGG. Here we use phage-assisted continuous 
evolution to evolve an expanded PAM SpCas9 variant (xCas9) that can recognize a broad range of PAM sequences 
including NG, GAA and GAT. The PAM compatibility of xCas9 is the broadest reported, to our knowledge, among Cas9 
proteins that are active in mammalian cells, and supports applications in human cells including targeted transcriptional 
activation, nuclease- mediated gene disruption, and cytidine and adenine base editing. Notably, despite its broadened PAM 
compatibility, xCas9 has much greater DNA specificity than SpCas9, with substantially lower genome-wide off-target 
activity at all NGG target sites tested, as well as minimal off-target activity when targeting genomic sites with non- NGG 
PAMs. These findings expand the DNA targeting scope of CRISPR systems and establish that there is no necessary trade- 
off between Cas9 editing efficiency, PAM compatibility and DNA specificity. 


The CRISPR-Cas9 system has facilitated widely used genome mani- 
pulation capabilities including targeted gene disruption’, transcrip- 
tional activation and repression®, epigenetic modification’, and direct 
conversion of a target base pair to a different base pair*® in a broad 
range of organisms and cell types®. CRISPR-Cas9 targets DNA in 
a manner that is programmed by an RNA (typically a single-guide 
RNA, or sgRNA’) that contains a spacer sequence complementary 
to the target DNA site, the protospacer. In addition to a protospacer 
that complements the sgRNA, a Cas9 target site must also contain 
a PAM sequence to support recognition by Cas9. The NGG PAM 
requirement of canonical SpCas9, which occurs on average only once 
in every 16 randomly chosen genomic loci, greatly limits the target- 
ing scope of Cas9 especially for applications that require precise Cas9 
positioning, such as base editing, which requires a PAM approximately 
13-17 nucleotides from the target base*®, and some forms of homology- 
directed repair, which are most efficient when DNA cleavage 
occurs roughly 10-20 base pairs away from a desired alteration®?. 

These requirements limit the fraction of gnomic DNA that can be 
targeted with CRISPR systems and highlight the need for more general 
genome-editing tools. 

To address this limitation, researchers have harnessed natural 
CRISPR nucleases with different PAM requirements and engineered 
existing systems to accept variants of naturally recognized PAMs. Other 
natural CRISPR nucleases shown to function efficiently in mammalian 
cells include Staphylococcus aureus Cas9 (SaCas9)'°, Acidaminococcus 
sp. Cpf1"!, Lachnospiraceae bacterium Cpf1"', Campylobacter jejuni 
Cas9!”, Streptococcus thermophilus Cas9'° and Neisseria meningitidis 
Cas9!4. None of these CRISPR nucleases, however, offers a PAM that 
occurs as frequently as that of SpCas9. Although CRISPR nucleases 
engineered to accept additional PAM sequences'*'® also expand the 
scope of genomic targets available for Cas9-mediated manipulation, 
many target sequences remain inaccessible. 

Here we used phage-assisted continuous evolution (PACE) to rapi- 
dly generate Cas9 variants that accept an expanded range of PAM 


sequences. During PACE, host Escherichia coli cells continuously dilute 
an evolving population of bacteriophages (selection phages). Because 
dilution occurs faster than cell division but slower than phage replica- 
tion, only the selection phages, and not the host cells, can accumulate 
mutations’. Each selection phage carries a gene to be evolved instead of 
a phage gene (gene III) that is required for the production of infectious 
progeny phage. Selection phages that contain desired gene variants trig- 
ger host-cell gene III expression from the accessory plasmid and the 
production of infectious selection phages that propagate the desired 
variants. Phages encoding inactive variants do not generate infectious 
progeny and are rapidly diluted out of the culture vessel (Fig. 1a). As 
phage replication can occur in as little as 10 min, PACE enables hun- 
dreds of generations of directed evolution to occur per week without 
researcher intervention'”-**. 


Evolution of Cas9 towards expanded PAM compatibility 
To link Cas9 DNA recognition to phage propagation during PACE, we 
developed a bacterial one-hybrid selection?””*”4 in which the selec- 
tion phage encodes a catalytically dead SpCas9 (dCas9) fused to the 
w subunit of bacterial RNA polymerase. When this fusion binds an 
sgRNA encoded by the accessory plasmid and a PAM and protospacer 
upstream of gene III in the accessory plasmid, recruitment RNA poly- 
merase causes gene III expression and phage propagation (Fig. 1b). We 
envisioned installing a library of all 64 possible NNN PAM sequences 
at the target protospacer in the accessory plasmid, so that selection 
phages encoding Cas9 variants with broader PAM compatibility would 
replicate in a larger fraction of host cells and thus experience a fitness 
advantage. 

We optimized the relationship between Cas9 DNA binding and gene 
expression (Extended Data Fig. la—c). These studies revealed that a 
fusion of the orientation N-w-dCas9-C, together with a simple Ala- 
Ala fusion linker and the placement of the protospacer on the reverse 
complement strand 45 base pairs upstream of the —35 box results in the 
strongest guide RNA-dependent gene expression activation (13-fold) 


1Merkin Institute of Transformative Technologies in Healthcare, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA. 2Howard Hughes Medical Institute, Harvard University, 
Cambridge, Massachusetts 02138, USA. ?Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts 02138, USA. 
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Figure 1 | PACE of Cas9 variants with broadened PAM compatibility. 

a, PACE takes place in a fixed-volume ‘lagoor’ that is continuously diluted 
with fresh host E. coli cells. Upon infection, selection phages (SP) that 
encode a Cas9 variant capable of binding the target PAM and protospacer 
on the accessory plasmid (AP) induce expression of gene III, resulting 

in infectious progeny phages that propagate the active Cas9 variant in 
subsequent host cells. pIII, attachment protein G3P (encoded by gene III). 
b, Representaion of a phage-infected host cell during PACE. The host 

cell carries the accessory plasmid, which links Cas9 target DNA binding 
to phage propagation, and the mutagenesis plasmid (MP), which 
increases mutagenesis during PACE. RNAP, RNA polymerase. c, d, The 
crystal structure of SpCas9 with the location of xCas9 mutations shown. 
e, Genotypes of some evolved xCas9 variants, coloured by evolution stage. 
See Supplementary Table 5 for the 95 xCas9 variant genotypes. 


(Extended Data Fig. la—c). Together, these results establish a linkage 
between Cas9 DNA-binding activity and gene expression in a selection 
system suitable for PACE. 

Using this selection, we first allowed a selection phage encoding 
the w-dCas9 fusion to self-optimize on host cells containing an acces- 
sory plasmid with a canonical NGG PAM, resulting in enrichment of 
an I12N mutation in the w subunit. Adding this single mutation to 
w-dCas9 boosted activation from 13-fold to over 100-fold (Extended 
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Data Fig. 1d), representing early-stage optimization of w-dCas9 during 
PACE before evolution for broadened PAM compatibility. 

To evolve Cas9 variants with expanded PAM compatibility we gene- 
rated three accessory plasmid libraries, each containing a different 
sgRNA (Supplementary Table 4) and a corresponding protospacer 
upstream of an NNN PAM library, in which N is an equimolar mixture 
of all four DNA bases. This design imposes selection pressure to recog- 
nize many different PAM sequences, as well as to maintain compatibility 
with different target DNA sequences. All three accessory plasmid 
libraries were introduced into host E. coli cells harbouring the mutagen- 
esis plasmid MP6’’. The resulting host cells were incubated overnight 
with selection phages containing w(I12N)-dCas9. This phage-assisted 
non-continuous evolution system!?”! preferentially replicates Cas9 
variants that bind a greater variety of PAM sequences, similar to PACE, 
but with lower stringency since there is no outflow of the phages. 

After 24 days of serial overnight propagation and 1:1,000 dilution, 
we isolated five Cas9 clones for sequencing and characterization 
(xCas9-1.0-xCas9-1.4). Notable recurring mutations include E480K, 
E543D and E1219V (Fig. le and Supplementary Table 5). E1219 is 
close in the SpCas9 crystal structure to R1333 and R1335 (Fig. 1c), 
two residues that are known to have a critical role in PAM recog- 
nition”. The mixture of phage from the final phage-assisted non- 
continuous evolution pool were further evolved for 72h using PACE 
on host cells containing the same accessory plasmid libraries harbour- 
ing NNN PAM sequences. Among the individual Cas9 clones emerg- 
ing from PACE (xCas9-2.0-xCas9-2.6), E480K, E543D and E1219V 
were present in all sequenced phages, along with additional mutations 
seen in multiple clones, including A262T, K294R, S409] and M694I 
(Fig. le and Supplementary Table 5). Finally, the resulting phages were 
continuously evolved using PACE for an additional 72h on host cells 
containing three protospacer-sgRNA pairs and HHH PAM libraries, 
in which H is A, C, or T, to favour Cas9 variants with activity on non- 
NGG PAMs. 

Fourteen resulting evolved Cas9 variants (xCas9-3.0-xCas9-3.13) 
containing consensus mutations (Fig. le and Supplementary Table 5) 
emerged from two apparent evolution trajectories. Although all phages 
shared E480K, E543D and E1219V core mutations, xCas9-3.0—xCas9- 
3.5 also all contained K294R and Q1256K mutations and xCas9-3.6— 
xCas9-3.13 all contained A262T, $4091 and M694I mutations, with 
some of the latter also containing R324L. The Cas9 crystal structure 
predicts that R324L, S409I and M6941 lie near the DNA-sgRNA 
interface (Fig. 1d) and could have a role in mediating DNA sequence 
recognition and the switching of Cas9 from the open to the closed con- 
formation upon target recognition”®”’. Because the entire Cas9 gene 
was subject to mutagenesis during PACE, the mechanism of xCas9 may 
differ from that of engineered Cas9 variants that primarily mutated 
DNA-contacting residues'*'®8, 

We characterized evolved xCas9 variants in several contexts. We 
first restored the catalytic residues D10 and H840 to test whether 
xCas9 nucleases can cleave DNA even though they were evolved only 
for DNA binding. The xCas9-3.0-xCas9-3.13 clones were tested in a 
PAM depletion assay!*'° in which they were given the opportunity to 
cleave a library of plasmids containing a protospacer and all possible 
NNN PAM sequences in an antibiotic-resistance gene in bacterial cells. 
Plasmid cleavage results in the loss of spectinomycin resistance. This 
PAM depletion assay revealed that xCas9-3.0-xCas9-3.3 and xCas9- 
3.5-xCas9-3.9 cleave DNA site with NG, NNG, GAA, GAT and CAA 
PAMs (Extended Data Fig. 2). The clone with the highest PAM deple- 
tion score, xCas9-3.7, depleted NG, NNG, GAA, GAT and CAA PAMs 
by at least 100-fold compared to the starting library, and xCas9-3.6 
showed the second highest average PAM depletion score. 


xCas9 activators and nucleases in human cells 

To test whether mutations evolved during PACE in bacteria are compa- 
tible with xCas9 function in mammalian cells, we characterized xCas9 
variants for their activity and PAM compatibility in human cells in 
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four contexts: transcriptional activation, genomic DNA cutting, cyti- 
dine base editing and adenine base editing. All guide RNAs used in 
this study were transcribed from a U6 promoter, and natively started 
with a G at the 5’ end to avoid possible losses in activity caused by 
a mismatched 5’ guide terminus’*!°”*?. To test for transcriptional 
activation, catalytically dead versions of xCas9 were fused to the 
transcriptional activator VP64—p65—Rta (dxCas9-VPR)*!. Plasmids 
encoding dxCas9-VPR, a green fluorescent protein (GFP) reporter 
downstream of a target protospacer and a corresponding sgRNA were 
co-transfected into human HEK293T cells*’. Target-gene transcrip- 
tional activation was measured by cellular GFP fluorescence after three 
days. Three different target-site PAM sets were tested: a single reporter 
with an NGG PAM, a reporter library containing a NNN PAM library 
anda reporter library containing a NNNNN PAM library. In addition, 
two different protospacer sequences, reporter 1 and reporter 2, were 
tested with their corresponding sgRNAs. 

Most early stage xCas9 variants outperformed wild-type SpCas9 
on sites with NGG PAMs, as well as with NNN and NNNNN PAM 
libraries (Extended Data Fig. 3a). For reporters 1 and 2, respectively, 
xCas9-3.7 achieved 2.8- and 1.5-fold higher mean fluorescence for 
the NGG PAM, 7.9- and 2.1-fold higher mean fluorescence for the 
NNN PAM library, and 5.2- and 1.7-fold higher mean fluorescence for 
the NNNNN PAM library when compared with SpCas9 (Extended 
Data Fig. 3b, c). The similar performance of xCas9-3.7 on NNN and 
NNNNN PAM libraries suggests that it did not evolve strong sequence 
preferences at nucleotides immediately downstream of the NNN PAM. 
The xCas9-3.6 variant showed similar results to those of xCas9-3.7 in 
this assay (Extended Data Fig. 3b, c). 

To dissect activity on individual PAM sequences, we tested dxCas9- 
VPR transcriptional activators on individual target sites containing 
each of the 64 possible three-nucleotide PAM sequences (Fig. 2a and 
Extended Data Figs 4-6) in HEK293T cells. Consistent with the PAM 
library results, dxCas9(3.7)-VPR showed broad improvements in tran- 
scriptional activity relative to dSpCas9-VPR across many individual 
non-NGG PAMs. Transcriptional activation by dxCas9(3.7)-VPR at 
sites containing NGT, NGA, NGC, NNG, GAA and GAT PAMs aver- 
aged 56-91% of the average activity of dxCas9(3.7)-VPR on the four 
NGG PAM sites (Fig. 2a and Extended Data Fig. 5). The performance 
of xCas9-3.6 transcriptional activators was similar to that of xCas9- 
3.7 (Extended Data Fig. 6). To test whether broadened transcriptional 
activation by xCas9 is limited to reporter plasmids that may be only 
partially chromatinized, we also tested the ability of dxCas9(3.7)-VPR 
to activate transcription of six endogenous genomic loci in human cells 
and observed 3.3-fold average improved activation of the two NGG 
PAM sites and 39-fold average improved activation among the three 
NGN PAM sites, but no improvement on the tested NNG site, relative 
to dSpCas9-VPR (Extended Data Fig. 5e). Overall, these results estab- 
lish that xCas9 is compatible with the dCas9-VPR architecture and can 
serve as a potent transcriptional activator in human cells at a substan- 
tially expanded set of PAMs. On the basis of their strong performance 
in the PAM depletion assay and as transcriptional activators, we chose 
xCas9-3.7 and xCas9-3.6 for further characterization. 

To test targeted genomic DNA cleavage in human cells, we expressed 
xCas9-3.7 and -3.6 nuclease in a HEK293T cell line with a genomi- 
cally integrated GFP gene and measured the loss of GFP fluorescence 
reflecting DNA cleavage and indel-mediated disruption of the tar- 
get site. There are two NGG PAM sites and three NGT sites present 
in the GFP sequence, as well as individual NGC, NGA, GAT, NCG 
and NTG sites. All of these PAM sequences were tested with SpCas9, 
xCas9-3.7 and xCas9-3.6. For the NGG PAM sites, xCas9-3.7 modestly 
outperformed SpCas9, resulting in 46 + 2.0% compared to 33 +3.4% 
GFP disruption for SpCas9 (mean +s.d. of three independent repli- 
cates, Fig. 2b). For all tested non- NGG PAM sites, xCas9-3.7 showed 
substantially higher (1.6- to 5.1-fold) average apparent cleavage 
activity than SpCas9 (Fig. 2b). At all tested sites, xCas9-3.6 showed 
GFP disruption percentages that were similar to or slightly lower than 
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those of xCas9-3.7 (Extended Data Fig. 7a). Neither xCas9-3.7 nor 
xCas9-3.6 increased GFP loss relative to SpCas9 for either NNG PAM 
site tested, suggesting that the transcriptional activation by dxCas9- 
VPR observed at some NNG PAM sites, and the strong NNG PAM 
signal in the bacterial PAM depletion assay, do not necessarily trans- 
late to DNA cleavage at all NNG sites in mammalian cells. Target-site 
dependence is also well-known for SpCas9°”. 

To further characterize DNA cleavage by xCas9 variants in human 
cells, we targeted endogenous genomic sites in HEK293T cells and 
measured indel formation by high-throughput sequencing (HTS). 
Twenty endogenous sites were tested covering four NGG PAM sites, all 
twelve possible NGT, NGC and NGA PAMs, and GAT, GAA and CAA 
PAMs (Fig. 2c). On the four NGG PAM sites tested, xCas9-3.7 showed 
comparable activity to SpCas9, averaging 41 + 6.4% indels compared 
to 41+ 6.0% for SpCas9 (Fig. 2c). All four NGT PAM sites showed 
much higher indel formation with xCas9-3.7 than with SpCas9, aver- 
aging 38 + 4.1% indels compared to 8.6 + 1.5% for SpCas9, a 4.5-fold 
increase. The four NGA PAM sites averaged 32 + 2.4% indels for 
xCas9-3.7 and 20 + 2.7% for SpCas9, a 1.6-fold increase, consistent 
with previous reports that NGA can serve as a secondary PAM for 
SpCas9*°, Although indel frequencies at the endogenous NGC PAM 
sites were more variable, ranging from 4.8 to 31%, xCas9-3.7 aver- 
aged 13 + 0.90% indels compared to 6.3 + 0.77% for SpCas9, a 2.1-fold 
increase. Among the three GAA and GAT sites tested, SpCas9 showed 
virtually no activity, averaging 1.4 + 1.3% indel formation, whereas 
xCas9-3.7 averaged 7.2 + 2.8% indel formation, a 5.2-fold increase. 
For all sites tested the xCas9-3.6 variant showed indel frequencies that 
were similar to or slightly lower than those of xCas9-3.7 (Extended 
Data Fig. 7b). Negative control experiments lacking sgRNA plasmids 
resulted in no indels above background (Extended Data Fig. 8). Taken 
together, these results indicate that xCas9-3.7 nuclease mediates tar- 
get-gene disruption at NGG PAM sites with efficiencies that are com- 
parable to wild-type SpCas9, but cleaves NG, GAA and GAT PAM 
sites with substantially higher efficiencies than SpCas9. The greater 
PAM-dependent and protospacer-dependent variability of xCas9 
nuclease-mediated gene disruption relative to transcriptional activa- 
tion (Fig. 2 and Extended Data Figs 5-7) may reflect more extensive 
requirements for DNA cleavage than DNA binding”, or differences 
in the chromatin state of plasmid (Fig. 2a and Extended Data Figs 4, 
5a-d, 6) versus genomic targets (Fig. 2b, c and Extended Data 
Figs 5e, 7). 


Base editing by xCas9 variants in human cells 

Base editing is a newer genome editing approach that uses a catalyti- 
cally impaired Cas9 fused to a natural or laboratory-evolved nucleobase 
deaminase enzyme and, in some cases, a DNA glycosylase inhibitor to 
directly convert a target CeG to TeA, or a target AeT to GeC, without 
introducing double-stranded DNA breaks or requiring homology- 
directed repair****. The suitability of a target site for base editing is 
highly dependent on the presence of a suitably positioned PAM, which 
must exist within a narrow window downstream of the target base pair 
(typically 15 +2 nucleotides). The broad PAM compatibility of xCas9 
variants thus has the potential to expand the DNA targeting scope of 
base editors (Fig. 3c). 

To evaluate CeG-to-TeA base-editing activity of xCas9 variants, we 
substituted SpCas9 with xCas9-3.7 and -3.6 in the third-generation 
(BE3) base editor architecture’. We separately transfected plasmids 
encoding xCas9(3.7)-BE3, xCas9(3.6)-BE3 and SpCas9-BE3 into 
mammalian cells to compare editing efficiency on all 20 sites tested 
above for endogenous genomic DNA cleavage (Fig. 3a). At the four 
NGG PAM target sites tested, xCas9(3.7)-BE3 averaged 37 + 10% 
CeG-to-TeA conversion, whereas SpCas9-BE3 averaged 28 + 5.2% 
(Fig. 3a). At the NG, GAA and GAT PAM sites tested, xCas9(3.7)-BE3 
resulted in substantially improved base editing, averaging 24 + 5.4% 
editing at NGT PAM sites, an 9.5-fold increase compared with that of 
SpCas9; 16 +3.5% editing at NGA sites, a 3.5-fold increase compared 
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Figure 2 | Transcriptional 
activation and genomic DNA 
cleavage by evolved xCas9-3.7 in 
human cells. a, Transcriptional 
activation by dSpCas9-VPR and 
dxCas9(3.7)-VPR targeting GFP 
reporter plasmids containing the 
same protospacer but different 
PAM sites in HEK293T cells. AU, 
arbitrary units. b, Genomic DNA 
cleavage by SpCas9 or xCas9-3.7 
in HEK293-GFP cells that have 

a genomically integrated GFP 
reporter gene. After 5 days, the 
cells were analysed for loss of GFP 
fluorescence by flow cytometry. 

c, DNA cleavage of endogenous 
genomic DNA sites with NGG 
and non-NGG PAMs by SpCas9 
and xCas9-3.7 in HEK293T cells. 
Indel rates were measured by HTS 
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with SpCas9; 6.2 + 0.34% editing at NGC sites, a 13-fold increase over 
SpCas9; 10 + 0.75% editing on the GAA PAM site, a more than 50-fold 
increase over SpCas9; and 12 + 1.5% editing on the GAT sites, a greater 
than 100-fold increase compared with SpCas9 (Fig. 3a). The base edit- 
ing efficiencies of xCas9(3.6)-BE3 were comparable to, or slightly worse 
than, those of xCas9(3.7)-BE3 (Extended Data Fig. 7c). We also tested 


60 | NATURE | VOL 556 | 5 APRIL 2018 


NGA GAN NAA 


cytosine base editing at an additional 15 endogenous genomic sites 
within the FANCF gene and observed similar large improvements for 
xCas9(3.7)-BE3 over SpCas9-BE3 (Extended Data Fig. 9a). Overall, 
these results indicate that xCas9 variants are compatible with the 
BE3 architecture enabling cytidine base editing of target sites that 
cannot be accessed by SpCas9-BE3 or, with the exception of NGA PAM 
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Figure 3 | Cytidine and adenine 
base editing by xCas9. a, CeG- 
to-TeA conversion frequencies at 
the most efficiently edited base 

for 20 endogenous genomic loci 

in HEK293T cells 3 days after 
plasmid transfection. b, AeT-to- 
GeC conversion frequencies at the 
most efficiently edited base for 
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HEK293T cells 5 days after plasmid 
transfection. a, b, Data are mean and 
s.d. of three biologically independent 
samples. See Supplementary Table 
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sites*4, by any other previously reported base editors. We also tested 
xCas9-3.7 in the BE4 architecture that was designed to reduce undesired 
byproducts*». In this system, fewer indels and higher product puri- 
ties were observed, although editing efficiencies were slightly lower 
(Extended Data Fig. 9b-d). 

The recent development of an adenine base editor (ABE) enables 
programmable installation of AeT-to-GeC mutations’. No ABEs have 
been reported that can target non- NGG PAM sites, limiting its tar- 
geting scope. We replaced SpCas9 in ABE 7.10 (ref. 5) with xCas9-3.7 
and -3.6, and assayed the resulting xCas9-ABEs in HEK293T cells at 
the seven endogenous genomic sites tested above that contain an A 
in the targeting window of ABE (positions 4-8, counting the PAM as 
positions 21-23). At all seven of these sites, xCas9(3.7)-ABE resulted in 
higher base-editing efficiencies than the original SpCas9-ABE (Fig. 3b). 
Average base-editing efficiency at the NGG PAM site tested increased 
from 48 + 2.1% to 69 + 3.7%. At the GAT PAM site tested, xCas9(3.7)- 
ABE resulted in 16 + 1.5% base editing, whereas SpCas9-ABE yielded 
no detectable editing (at most 0.1%), representing a more than 100-fold 
increase. On the two NGC and three NGA sites tested, xCas9(3.7)-ABE 
averaged 21 + 2.5% and 43 + 1.5% base editing, respectively, whereas 
SpCas9-ABE averaged 7.0 + 1.3% and 22 + 1.2%, respectively. Base 
editing by xCas9(3.7)-ABE was comparable to or higher than that of 
xCas9(3.6)-ABE (Extended Data Fig. 7d). Collectively, these results 
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establish that xCas9-ABE mediates adenine base editing at sites that 
cannot currently be accessed. 


Improved DNA specificity of xCas9 in human cells 
Because PAM recognition is a crucial component of Cas9 DNA spec- 
ificity’’, the substantially broadened PAM compatibility of xCas9 
proteins would be expected to increase their off-target activity”®*®. 
Indeed, the engineered S. aureus KKH-SaCas9, which accepts an 
NNNRERT PAM instead of the native NNGRRT SaCas9 PAM, exhib- 
its similar or higher levels of off-target editing than SaCas9!°. Most 
of the xCas9 mutations are close to the PAM or to the DNA-sgRNA 
interface (Fig. 1), raising the possibility that these mutations might 
alter the degree to which mismatches between the spacer and proto- 
spacer impede productive DNA binding or editing. Previous studies 
have demonstrated that some mutations near the Cas9-DNA interface 
can reduce off-target activity, in some cases without affecting on-target 
modification efficiency***”*8, 

To test the off-target activity of xCas9 variants, we performed 
genome-wide, unbiased identification of double-strand breaks ena- 
bled by sequencing (GUIDE-seq), an off-target analysis method”®, 
of xCas9-3.7, xCas9-3.6 and SpCas9 in HEK293T and U20OS cells. 
Notably, for all five endogenous genomic NGG PAM sites tested in 
HEK293T cells and for both NGG PAM sites tested in U2OS cells, 
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Figure 4 | Off-target editing 
analysis of xCas9. a, GUIDE-seq”® 
was performed on SpCas9 and 
xCas9-3.7 nucleases. Six endogenous 
genomic sites with NGG PAMs 
were tested in HEK293T or U2O0S 
cells. The percentage of off-target 
sequencing reads relative to total 
reads are shown. b-d, All GUIDE- 
seq on-target reads (indicated by 
an asterisk) and off-target reads 
for three sites in HEK293T cells 
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GUIDE-seq analysis revealed that xCas9-3.7 and -3.6 resulted in 
much lower off-target activity than SpCas9, as reflected by both 
the number of detected off-target sites as well as the total modifica- 
tion frequency at each detected off-target site (Fig. 4 and Extended 
Data Fig. 10). For example, at the EMX1 target site in HEK293T 
cells, SpCas9 showed 5,649 on-target reads and a total of 1,132 off- 
target reads, whereas xCas9 3.7 showed 6,874 on-target reads but no 
off-target reads (Fig. 4b). In U2OS cells, the off-target to on-target 
ratio for the same site was 0.65 for SpCas9 with 6,328 on-target reads, 
and 0.015 for xCas9-3.7 with 22,539 on-target reads, representing a 
43-fold reduction in off-target modification (Extended Data Fig. 10c). 
Likewise, for HEK sites 1, 2 and 3, xCas9-3.7 resulted in at least 100- 
fold lower off-target to on-target modification ratios (0.023, less 
than 0.001 and 0.028, respectively) than those of SpCas9 (0.28, 0.14 
and 0.33, respectively) (Fig. 4 and Extended Data Fig. 10). For the 
known highly promiscuous target sites HEK site 4 and VEGFA”, the 
off-target to on-target ratios were 9.4 and 2.0, respectively, for SpCas9, 
but only 1.0 and 0.48 for xCas9-3.7, a 4.2- to 9.4-fold improvement 
(Extended Data Fig. 10). We observed these large improvements in 
DNA specificity for xCas9-3.7 and -3.6 even though xCas9 variants 
showed a much broader range of PAM sequences among detected 
off-target sites (Fig. 4 and Extended Data Fig. 10). The GUIDE-seq 
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results were verified by HTS of many individual on-target and off- 
target sites from the genomic DNA of treated cells (Extended Data 
Fig. 11). 

We also evaluated the off-target DNA specificity of both SpCas9 and 
xCas9-3.7 at two non-NGG PAM (GAA and CGT) sites in HEK293T 
cells. As expected, GUIDE-seq did not yield any on-target reads for 
SpCas9 at either of these non- NGG PAM sites, whereas xCas9-3.7 had 
3,627 on-target reads for the GAA PAM site and 3,055 on-target reads 
for the CGT PAM site (Fig. 4e, f). Notably, neither site exhibited any 
detected off-target GUIDE-seq reads for xCas9-3.7, although potential 
off-target reads were detected for SpCas9 (Fig. 4e, f). Collectively, these 
findings reveal that xCas9-3.7 and -3.6 offer greatly reduced off-target 
activity compared with wild-type SpCas9, despite their broader PAM 
compatibility. These results also establish that there is no trade-off 
between Cas9-mediated editing efficiency, PAM compatibility and 
DNA specificity, a key finding as natural and engineered genome- 
editing agents advance into widespread applications including human 
clinical trials. 

These results, together with the success of multiple independent 
efforts to create high-fidelity Cas9 variants”°?”"*, suggest that the DNA 
promiscuity of wild-type Cas9, which probably evolved to impede viral 
evasion, can be overcome by protein engineering or evolution. That 
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xCas9 exhibits much higher DNA specificity than SpCas9 even though 
it was not explicitly selected for this property suggests that the off- 
target activity of wild-type SpCas9 may lie at a narrow fitness peak 
that is suitable for defending the much smaller bacterial genome but 
not optimal for genome editing in mammalian cells. These observa- 
tions are therefore consistent with a model in which native SpCas9 is 
poised to become more specific, rather than less specific, upon suffi- 
cient mutation. 

To our knowledge, the targeting scope of xCas9 is the broadest 
among Cas9 variants known to function efficiently in mammalian cells. 
Evolved xCas9 variants are also the first to offer improvements in tar- 
geting scope, editing efficiency and DNA specificity in a single entity 
relative to wild-type SpCas9. Although the efficacy of xCas9 on non- 
NGG PAMs varies based on application (transcriptional activation, 
DNA cleavage or base editing) and on target site, the ability to access 
some NG, GAA and GAT PAM sequences greatly expands the breadth 
of targets available for site-sensitive genome editing applications. 
Compared to SpCas9-BE3, xCas9(3.7)-BE3 increases the percentage 
of the 4,422 pathogenic single-nucleotide polymorphisms (SNPs) in 
the ClinVar database*? that could, in principle, be targeted by CeG-to- 
TeA base editing from 26% to 73% (Fig. 3c). Likewise, xCas9(3.7)-ABE 
increases the fraction of the 14,969 pathogenic SNPs in Clin Var that 
could be targeted by AeT-to-GeC base editing from 28% to 71% 
(Fig. 3d). We anticipate that xCas9 and additional CRISPR enzyme 
variants with broadened PAM compatibilities may also expand the 
scope of other forms of nucleic acid editing, CRISPR-based screens 
and epigenetic modification. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized and the investigators were not blinded to allocation during 
experiments and outcome assessment. 

General methods and cloning. DNA sequences used in this work are listed in 
the Supplementary Information. PCR was performed using Q5 Hot Start High- 
Fidelity DNA Polymerase (New England Biolabs) or Phusion U Green Multiplex 
PCR Master Mix (Thermo Fisher Scientific). PACE plasmids and phages were 
constructed by USER cloning or Gibson cloning (New England Biolabs). Cas9 
genes and plasmid backbones for PACE were obtained from previously reported 
plasmids’ that are available from Addgene. Plasmids encoding dxCas9-VPR*!, 
xCas9 nucleases, xCas9-BE3*, xCas9-BE4* and xCas9- ABE? were constructed by 
replacing SpCas9 using Gibson cloning. Plasmids for sgRNA expression were con- 
structed using one-piece blunt-end ligation of a PCR product containing a variable 
20-nucleotide sequence corresponding to the desired sgRNA targeted site. Primers 
and templates used in the synthesis of all sgRNA plasmids used in this work are 
listed in Supplementary Tables 8, 9, 11, 12, 15 and 16. PCR was performed using Q5 
Hot Start High-Fidelity Polymerase (New England Biolabs) with phosphorylated 
primers and the plasmid pFYF1320 (sgRNA expression plasmid with a spacer for 
targeting enhanced GFP (eGFP)) as a template according to the manufacturer's 
instructions. PCR products were analysed by agarose gel electrophoresis, the 
band of the expected molecular weight was excised, and the DNA was extracted 
using a Zymoclean Gel DNA Recovery Kit (Zymo Research) and ligated using T4 
DNA Ligase (New England Biolabs) according to the manufacturer's instructions. 
DNA vector amplification was carried out using Mach1 competent cells (Thermo 
Fisher Scientific). All mammalian ABE constructs, sgRNA plasmids and bacterial 
constructs were transformed and stored as glycerol stocks at —80°C in Mach1 
T1® Competent Cells (Thermo Fisher Scientific), which carry a recA mutation. 
Molecular biology grade Hyclone water (GE Healthcare Life Sciences) was used 
in all assays and PCR reactions. All vectors used in evolution experiments and 
mammalian cell assays were purified using ZymoPURE Plasmid Midiprep (Zymo 
Research), which includes endotoxin removal. Antibiotics were purchased from 
Gold Biotechnology. 

Cell culture. HEK293T (ATCC CRL-3216) and U20S (ATCC-HTB-96) cells were 
maintained in DMEM plus GlutaMax (Thermo Fisher Scientific) supplemented 
with 10% (v/v) fetal bovine serum (FBS) at 37°C with 5% CO). Cell lines tested 
negative for mycoplasmid contamination. 

Transfections. HEK293T cells were seeded on 48-well poly-p-lysine-coated 
BioCoat plates (Corning) and transfected at approximately 85% confluency. For 
genomic DNA cutting or base editing, 750 ng of Cas9 or BE3 and 250 ng of sgRNA 
expression plasmids were transfected using 1.5 1l of Lipofectamine 2000 (Thermo 
Fisher Scientific) per well according to the manufacturer’s protocol. For GFP acti- 
vation, 200 ng of dCas9-VPR plasmid, 50 ng of sgRNA expression plasmid, 60 ng 
of GFP reporter plasmid and 30 ng of near-infrared fluorescent protein (iRFP) 
expression plasmid were transfected using 1.5 11 Lipofectamine 2000 (Thermo 
Fisher Scientific) per well according to the manufacturer's protocol. Endogenous 
gene activation was performed similarly but with 200 ng of dCas9-VPR plasmid 
and 50 ng of sgRNA expression plasmid only. 

GFP transcriptional activation assay. Transfected HEK293T cells were trypsin- 
ized and resuspended in DMEM plus GlutaMax (Thermo Fisher Scientific) sup- 
plemented with 10% (v/v) FBS. The cells were kept on ice and flow cytometry 
was performed using a LSRFortessa from BD Biosciences. Events were gated for 
iRFP-positive cells to analyse transfected cells. The percentage of GFP-positive cells 
and the intensity of GFP fluorescence from each cell was collected. 

RNA expression quantification for endogenous transcriptional activation assay. 
RNA was extracted from HEK293T cells using the Quick-RNA Plus Kit (Zymo 
Research). cDNA was synthesized using the iScript cDNA Synthesis Kit (Bio-Rad) 
and quantitative PCR (qPCR) was performed on a Bio-Rad CFX96 Real-Time PCR 
Detection System using Q5 Polymerase (New England Biolabs) and SYBR Green 
(Lonza). Primers for qPCR are listed in Supplementary Table 10. 

PAM depletion assay. Electrocompetent NEB 10-beta cells (New England Biolabs) 
were electroporated with two plasmids. The first plasmid expresses Cas9 (induc- 
ible with anhydrotetracycline, ATc), the sgRNA (inducible with arabinose) and a 
spectinomycin-resistance gene. The second plasmid contains the target protospacer 
and a kanamycin-resistance gene. After incubation with SOC outgrowth medium 
(New England Biolabs) for 1h the bacteria were plated on agar plates containing 
both spectinomycin and kanamycin along with ATc and arabinose inducers. After 


incubating overnight, the bacterial cells on the agar plates were scraped and the 
plasmids extracted using the ZymoPURE Plasmid Midiprep Kit (Zymo Research). 
The resulting post-selection DNA included all of the protospacer plasmids not 
cleaved by Cas9. The same region around the protospacer in the pre-selection 
library and the post-selection DNA was then amplified separately using NEBNext 
High-Fidelity PCR Polymerase (New England Biolabs) with the flanking HTS 
primer pairs that are listed in the Supplementary Table 7. The Illumina barcoding 
PCR reaction was assembled with NEBNext High-Fidelity PCR Polymerase. PCR 
products were purified by electrophoresis with a 2% agarose gel using a QlAquick 
Gel Extraction Kit, eluting with 1511 of water. DNA concentration was quanti- 
fied with the KAPA Library Quantification Kit-Ilumina (KAPA Biosystems) and 
sequenced on an Illumina MiSeq instrument according to the manufacturer’s 
protocols. 

High-throughput DNA sequencing of genomic DNA samples. Transfected cells 
were harvested after 3 days (BE3 and BE4) or 5 days (DNA cutting and ABE). 
Medium was removed and cells were washed with 1 x PBS solution (Thermo Fisher 
Scientific). Genomic DNA was extracted by addition of 10011 freshly prepared lysis 
buffer (10 mM Tris-HCl, pH 7.0, 0.05% SDS, 25 1g ml’ proteinase K (Thermo 
Fisher Scientific)) directly into each well of the tissue culture plate. The plate was 
incubated at 37°C for 1 h. The genomic DNA mixture was transferred to a 96-well 
PCR plate and incubated at 80°C for 15 min to denature enzymes. Genomic regions 
of interest were amplified by PCR with flanking HTS primer pairs that are listed 
in Supplementary Table 13. Each 25,11 PCR reaction was assembled with Phusion 
Hot Start Il High-Fidelity DNA Polymerase (Thermo Fisher Scientific) according 
to the manufacturer’s instructions using 1.0\1M forward and reverse primer and 
1,1 of genomic DNA extract. PCR reactions were carried out under the following 
conditions: 95°C for 3 min, then 30 cycles of 98°C for 30s, 60°C for 20s and 72°C 
for 1 min, followed by a final 72°C extension for 5 min. PCR products were verified 
by comparison to DNA standards (1-kb Plus DNA Ladder) on a 2% agarose gel 
with ethidium bromide. Each 25-1 lumina barcoding PCR reaction was assem- 
bled with Phusion DNA polymerase according to the manufacturer’s instructions 
using 0.5|1M unique forward and reverse Illumina barcoding primer pair and 
1 of unpurified genomic amplification PCR reaction mixture. The barcoding 
reactions were carried out as follows: 98°C for 2 min, then 8 cycles of 98°C for 12s, 
61°C for 25s and 72°C for 30s, followed by a final 72°C extension for 1.5 min. PCR 
products were purified by electrophoresis with a 2% agarose gel using a QIAquick 
Gel Extraction Kit, eluting with 1511 of water. DNA concentration was deter- 
mined with the KAPA Library Quantification Kit-Ilumina (KAPA Biosystems) 
and sequenced on an Illumina MiSeq instrument according to the manufacturer's 
protocols. Analysis was carried out using previously published Matlab code? that 
is provided in Supplementary Notes 1 and 2. 

Analysis of human disease-associated mutations in ClinVar database. 
Bioinformatic analysis of the Clin Var database was carried out in a manner similar 
to previously described analysis*!. The code is provided in Supplementary Note 3. 
GUIDE-seq. HEK293T cells were transfected with 750 ng of the Cas9 plasmid, 
250 ng of the sgRNA plasmid and 20 pmol of GUIDE-seq dsODN. U20S cells 
were transfected with 750 ng of the Cas9 plasmid, 250 ng of the sgRNA plasmid 
and 100 pmol of GUIDE-seq dsODN. For both cell types, 2011 of solution SE 
(Lonza) was used along with a Lonza Nucleofector 4-D. Program CM-137 was 
used for HEK293T cells and program DN-100 was used for U2OS cells. Genomic 
DNA was extracted using the Quick-DNA Miniprep Plus Kit (Zymo Research) 
following the manufacturer’s protocol. The DNA was sheared to an average of 
500 base pairs using a Covaris $220 focused ultrasonicator as previously 
described”’. End repair, dA-tailing, adaptor ligation, tag-specific PCR1 and 
tag-specific PCR2 were carried out using the primers and methods described 
previously”*. DNA concentration was quantified with the KAPA Library 
Quantification Kit-Illumina (KAPA Biosystems) and sequenced on an Illumina 
MiSeq instrument according to the manufacturer's protocols. Analysis was carried 
out using previously published Python code”. 

Data availability. High-throughput sequencing data have been deposited in the 
NCBI Sequence Read Archive database under accession code SRP130166. GUIDE- 
seq sequencing files listed in Supplementary Table 17. Plasmids encoding the 
xCas9-3.7 and -3.6 transcriptional activators, nucleases, BE3, BE4 and ABE have 
been deposited with Addgene. 


40. Crooks, G. E., Hon, G., Chandonia, J. M. & Brenner, S. E. WebLogo: a sequence 
logo generator. Genome Res. 14, 1188-1190 (2004). 
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Extended Data Figure 1 | Optimization of Cas9 PACE. Luciferase 
expression in E. coli was used as a proxy of gene III expression levels 
during efforts to link Cas9 binding to gene expression for PACE. 

a, b, Seven guide RNAs targeting the luciferase reporter (G1-G7’, see 
Supplementary Table 1), as well as a scrambled guide RNA negative 
control (GO) were tested without dCas9 (white bars) and with w-dCas9 
(a) or dCas9-w (b) fusions (grey bars). c, Tests of seven different linkers 
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between w and dCas9. See Supplementary Table 2 for linker sequences. 

d, Evolution of w-dCas9 on an NGG PAM site in PACE yielded variants 
(PACE1, PACE2 and PACE3) that were tested in comparison with 
canonical wild-type (wt) w-dCas9, w tethered to PACE1 dCas9 and the 
112N w mutant tethered to canonical wild-type dCas9. a-d, Data are mean 
and s.d. of three biologically independent samples. 
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Extended Data Figure 2 | PAM profiling of xCas9 variants. a, In 
separate experiments, a plasmid library containing a protospacer with all 
possible NNN PAM sequences and a spectinomycin-resistance gene was 
electroporated into E. coli along with a plasmid expressing SpCas9 or the 
xCas9 variant shown. PAMs that are cleaved are depleted from the library 
when plated on medium containing spectinomycin. HTS of the library 
before and after selection enables quantification of the change in library 


composition, resulting in a sequence logo“ for the PAM preference of 
SpCas9 (left) and xCas9-3.7 (right). b, c, PAM depletion scores of Cas9 
variants from spectinomycin selection in E. coli, calculated as described 
previously*®, with 1.0 representing complete cleavage of that PAM 
sequence. Scores for NGN, NNG, GAA, GAT and CAA are shown in b and 
the rest of the PAM sequences are shown in c. 
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Extended Data Figure 4 | Transcriptional activation with xCas9-2.0. 
a-d, The transcriptional activator dxCas9(2.0)-VPR was tested on 
the R1 protospacer (Extended Data Fig. 3 and Supplementary Table 8) 
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Extended Data Figure 5 | Transcriptional activation with xCas9-3.7 on MIAT or RHOXF2 at six total sites. RNA expression as measured by qPCR 


all 64 NNN PAM sites and endogenous gene activation in human cells. with reverse transcription (RT-qPCR) was compared to background 
a-d, The transcriptional activator dxCas9(3.7)—VPR was tested on the R1 expression levels for each gene (measured in the control with no sgRNA) 
protospacer (Extended Data Fig. 3 and Supplementary Table 8) with each and was normalized to ACTB expression. a—e, Data are mean and s.d. 
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Extended Data Figure 7 | Genomic DNA cleavage and base editing by 
evolved xCas9-3.6. a, Genomic DNA cleavage by SpCas9 or xCas9-3.6 
(transfected as plasmids), in HEK293-GFP cells containing a genomically 
integrated GFP gene. After 5 days, the cells were analysed for loss of GFP 
fluorescence by flow cytometry. Sequences for all target sites are listed 

in Supplementary Table 11. b, DNA cleavage of endogenous genomic 
DNA sites with a variety of NGG and non-NGG PAMs by SpCas9 and 
xCas9-3.6 in HEK293T cells. Indel rates were measured using HTS 5 days 
after plasmid transfection. Sequences for all target sites are listed in 
Supplementary Table 12. c, CeG-to-TeA base editing in HEK293T cells 
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by SpCas9-BE3 or xCas9(3.6)—BE3 was tested at 20 sites containing NG, 
GAA, GAT, or CAA PAM sites. The CeG-to-TeA conversion frequency, 
ascertained using HTS, at the most efficiently edited base 3 days after 
plasmid transfection is shown. d, Of the 20 sites in c, seven contained an 

A in the canonical window for ABE editing® and were tested for AeT-to- 
GC base editing by SpCas9-ABE and xCas9(3.6)—ABE. The AeT-to-GeC 
conversion frequency, ascertained using HTS, at the most efficiently edited 
base 5 days after plasmid transfection is shown. a—d, Data are mean and 
s.d. of three biologically independent samples. Complete HTS results 
across the protospacer are provided in Supplementary Table 14. 
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Extended Data Figure 9 | Cytidine base editing at 15 additional targeting sites with NGG PAMs (c) and non-NGG PAMs (d). e, f, Product 


genomic sites and xCas9 base editing with the BE4 architecture. a, Base distribution among edited DNA sequence reads (reads in which the target 
editing by SpCas9-BE3 and xCas9(3.7)-BE3 at 15 sites within the FANCF C is mutated) following treatment with BE3 or BE4 variants targeting 


gene in HEK293T cells. The CeG-to-TeA conversion frequency at the sites with NGG PAMs (e) and non-NGG PAMs (f). Because SpCas9 has 
most efficiently edited base 3 days after plasmid transfection is shown. minimal activity on non-NGG PAM sites, only xCas9(3.7)-BE3 and 

b, Test of xCas9-3.7 in the BE4 architecture** on the same sites tested in xCas9(3.7)-BE4 data are compared on non-NGG PAM sites. a-f, Data are 
Fig. 3. The CeG-to-TeA conversion frequency in HEK293T cells at the mean and s.d. of three biologically independent samples. Target sites are 
most efficiently edited base 3 days after plasmid transfection is shown. listed in Supplementary Table 12. 


c, d, Indel frequency following treatment with BE3 or BE4 variants 
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Extended Data Figure 10 | Additional characterization of xCas9-3.7 SpCas9 and xCas9-3.7 in HEK293T and U20OS cells were also tested 
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in U2OS cells (c, d) were analysed after treatment with SpCas9 and xCas9-__— Table 15. 
3.7. e, All six GUIDE-seq sites with an NGG PAM that were tested with 
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Extended Data Figure 11 | Validation by high-throughput sequencing 
of GUIDE-seq results. ad, The most frequent off-target sites identified 
using GUIDE-seq were verified by HTS of genomic DNA following 
treatment of HEK293T cells with SpCas9 or xCas9-3.7 (a, b), or following 
treatment with SpCas9 or xCas9-3.6 (c, d). New sites with non-NGG 


PAMs that were identified as off-target sites of the xCas9 proteins were 
also analysed in a-d. a—d, Data are mean and s.d. of three biologically 
independent samples. Target sequences are listed in Supplementary 
Table 16. 
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Architecture of the human GATORI and 
GATOR1-Rag GTPases complexes 


Kuang Shen!?*4*, Rick K. Huang>*, Edward J. Brignole*®, Kendall J. Condon!?**, Max L. Valenstein!*34, 
Lynne Chantranupong!*34+, Aimaiti Bomaliyamu!, Abigail Choe!, Chuan Hong°, Zhiheng Yu & David M. Sabatini>?*4 


Nutrients, such as amino acids and glucose, signal through the Rag GTPases to activate mTORC1. The GATOR! protein 
complex—comprising DEPDC5, NPRL2 and NPRL3—regulates the Rag GTPases as a GTPase -activating protein (GAP) for 
RAGA; loss of GATOR] desensitizes mTORC1 signalling to nutrient starvation. GATOR1 components have no sequence 
homology to other proteins, so the function of GATOR] at the molecular level is currently unknown. Here we used 
cryo-electron microscopy to solve structures of GATOR1 and GATORI-Rag GTPases complexes. GATORI1 adopts an 
extended architecture with a cavity in the middle; NPRL2 links DEPDC5 and NPRL3, and DEPDC5 contacts the Rag 
GTPase heterodimer. Biochemical analyses reveal that our GATOR1-Rag GTPases structure is inhibitory, and that at least 
two binding modes must exist between the Rag GTPases and GATORI. Direct interaction of DEPDC5 with RAGA inhibits 
GATOR1-mediated stimulation of GTP hydrolysis by RAGA, whereas weaker interactions between the NPRL2-NPRL3 
heterodimer and RAGA execute GAP activity. These data reveal the structure of a component of the nutrient-sensing 
mTORCI pathway and a non-canonical interaction between a GAP and its substrate GTPase. 


The mTORCI pathway is a central regulator of cell growth'°. Nutrients 
signal to mTORCI] through the heterodimeric Rag GTPases (RAGA or 
RAGB bound to RAGC or RAGD)*®. When nutrients are abundant, 
RAGA binds GTP and RAGC binds GDP and this complex recruits 
mTORCI to the lysosomal surface!, where RHEB stimulates the kinase 
activity of mMTORC1!!’*. Upon nutrient starvation, the Rag GTPases 
adopt the opposite nucleotide loading state and cannot bind mTORC1, 
which becomes inhibited)”. 

The intrinsic GTP hydrolysis rates of the Rag GTPases are slow’, 
posing a problem for quickly altering the nucleotide state when nutri- 
ent levels change. Two GAP complexes, GATOR1!®!9 and FLCN- 
FNIP2””!, have been discovered that stimulate GTP hydrolysis by 
RAGA or RAGB and RAGC or RAGD, respectively. Both GATORI and 
FLCN-FNIP2 are deregulated in human disease, with loss-of-function 
mutations in GATOR1 being a frequent cause of familial epilepsy?” 

GATORI has three stably interacting subunits: DEPDC5, NPRL2 and 
NPRL3. Despite its central role in mTORCI signalling'*!*4, there is 
currently an almost complete lack of structural information about this 
complex. Protein structure prediction software, such as I-TASSER” and 
Jpred®, shows that all three subunits have low primary sequence simila- 
rity to other proteins and as a consequence have poorly defined domains. 
The only domains in GATOR] with orthologous structures are two 
longin domains’’, one each at the N terminus of NPRL2 and NPRL3, 
and a DEP (Dishevelled, EGL-10 and pleckstrin) domain in DEPDC5. 
Here we used cryo-electron microscopy (cryo-EM) to solve the structure 
of GATORI on its own and in complex with the Rag GTPases. 


Structural determination of GATORI complexes 

To generate GATORI for structural studies we co-expressed NPRL2, 
NPRL3 and DEPDCS in FreeStyle 293-F cells (Fig. 1a, b). To ensure a 
stable interaction between GATOR] and the Rag GTPases, we puri- 
fied a Rag GTPase heterodimer consisting of wild-type RAGA and 


mutant RAGC(S75N) that eliminates its capacity to bind GTP but 
not GDP!”. We loaded this heterodimer with an excess amount of 
guanosine 5’-[8,\-imido]triphosphate (GppNHp; a non-hydrolysa- 
ble GTP analogue) and GDP to lock its nucleotide-binding config- 
uration to GppNHpeRAGA-RAGC(S75N)eGDP, which is the most 
favourable for interacting with GATORI. Indeed, all five subunits 
co-eluted in the same fraction after gel filtration separation (Fig. la, b 
and Extended Data Fig. 1a, b). Consistent with previous studies!7-!°, 
purified GATOR] stimulated GTP hydrolysis by the RAGA-~RAGC 
heterodimer by 14-fold, but had no effect on the complex containing 
mutant RAGA(Q66L) (Fig. 1c). 

Well-defined particles of GATOR1 (290 kD) and the GATORI- 
Rag GTPases complex (370 kD) were clearly visualized by cryo-EM 
(Extended Data Fig. 1c, d). Reference-free 2D classification revealed 
explicit structural details with views from different orientations 
(Extended Data Fig. le-g). High-resolution 3D refinements from a 
homogeneous subset of 3D classifications generated the final envelopes 
for GATORI (Fig. 1d) and the GATORI-Rag GTPases complex (Fig. le) 
at 4.4A and 4.0A resolutions (gold-standard criteria, Fig. 1f). 

Despite the lack of homologous structures for use as references, the 
electron microscopy density maps enabled direct tracing of backbones 
and registering of bulky residues, and thus the building of a tentative 
structural model for GATOR1 de novo. We resolved roughly 75% of 
GATORI, except for two flexible regions in DEPDCS that lack corre- 
sponding electron microscopy density (Fig. 2a, b). For the core region 
of DEPDCS, we reached near-atomic resolution where secondary 
structures and side chains were unambiguously resolved (Extended 
Data Fig. 1h-j). Within the GATORI-Rag GTPases complex, GATOR1 
adopts a similar conformation to that of free GATOR1. Because the Rag 
GTPase heterodimer shares sequence similarity with its yeast homo- 
logue, Gtrlp-Gtr2p*”?, we were able to fit a homologous model into 
the extra electron microscopy density (Fig. 2c, d). 
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Architecture of GATORI and GATORI-Rag GTPases 

The structural model reveals that the GATOR] subunits contain several 
previously unidentified domains. DEPDCS5 has five domains, which 
we named—in order from the N terminus to the C terminus—the 
N-terminal domain (NTD), structural axis for binding arrangement 
(SABA) domain, steric hindrance for enhancement of nucleotidase 
activity (SHEN) domain, DEP domain and C-terminal domain (CTD) 
(Fig. 2e and Extended Data Fig. 2a, b). Although the DEP domain 
is well-defined, to our knowledge the other four domains are here 
resolved and visualized for the first time. 

The NTD localizes to the lateral side of DEPDC5 (Extended Data 
Fig. 2b). It has two lobes, both of which consist of a 8-sheet with an 
adjacent a-helix (Extended Data Fig. 2c, d). A VAST search® for 
homologous structure models shows that lobe B shares structural simi- 
larity with the NTD of PEX1 AAA-ATPase (Extended Data Fig. 2e), 
which may serve as an adaptor for ubiquitin or UBX domains*". 
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AF Figure 1 | Structural determination of GATORI1 
- a om and the GATOR1-Rag GTPases complex. 
ae a, Gel filtration profiles for GATORI (red line) 
& and GATORI1-Rag GTPases (blue line). mAU, 
“3 0.001 milli-absorbance unit. b, Coomassie blue stained 
S SDS-PAGE analysis of purified GATORI (red) 


GAP activity of 


0.0001 ; 7 and GATORI1-Rag GTPases (blue). Asterisk 
CNOA denotes a non-specific contamination that co- 
RAGA WT  Q66L 


purifies with GATORI. ¢, In vitro GAP activity of 
} purified GATORI. WT, wild type; kotsa, observed 
rate constant. d, e, Envelopes of GATORI (d) 
and GATORI1-Rag GTPases (e) from the 3D 
reconstructions with density shown at 0.05 
threshold level (UCSF Chimera). Scale bars, 
2nm. f, Gold-standard Fourier shell correlation 
(FCS) for GATOR] (red line) and the GATORI1- 
Rag GTPases (blue line). Data in a—c are 
representative of two independent experiments. 
See Supplementary Table 1 for cryo-EM data 
collection and refinement. 


The SABA domain (residues 168-427, previously annotated as 
DUF3608, domain of unknown function 3608) immediately follows 
the NTD of DEPDCS (Fig. 3a). It has a globular shape and shares 
topological similarities with the NADP domain of flavodoxin reduc- 
tase (NDER)*’ and the CD1 1a I-domain* (CD11I, Extended Data 
Fig. 2f-h), both of which contain ligand-binding motifs, NDFR for 
flavodoxin and CD 111 for manganese(II). The SABA domain consists 
of six B-strands (851-854, 8S6, 8S9) that form a platform surrounded 
by four a-helices (aS1-aS4), two on each side (Fig. 3a). It is conserved 
at the sequence level in Iml1p”**4, the yeast homologue of DEPDCS, 
and organizes the assembly of GATORI by mediating interactions with 
the NPRL2-NPRL3 heterodimer (see below). 

The SHEN domain (residues 720-1,010) connects to the SABA 
domain through a loop. Four ($-strands construct its base, and two 
a-helices cover one side of the sheet (Fig. 3a). The SHEN domain 
uses two flexible regions (linker S and loop S) to form interdomain 


Figure 2 | Architectures of GATORI and of the 
GATORI1-Rag GTPases complex. a, c, Atomic 
models and domain assignment for GATOR1 
(a) and the GATOR1-Rag GTPases complex (c). 
b, d, Local resolution of GATOR] (b) and 

the GATOR1-Rag GTPases complex (d). 

e, Domain organization and interaction map 

for the GATORI-Rag GTPases complex. Grey 
bars indicate domain—domain interactions. 
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Figure 3 | Domain structures within the GATOR1-Rag GTPases 
complex. a, Structures for the SABA and SHEN domains of DEPDC5. 
Interdomain contacts are illustrated with green lines. b, Structure and 
topological diagram for the TINI domain of NPRL2. Nt, N terminus; Ct, 
C terminus. c. Structure of the RAGA-~RAGC(S75N) heterodimer. NBD, 
nucleotide-binding domain. CRD, C-terminal roadblock domain. 


contacts. Linker S contains a 3-strand (8H1) and an a-helix (aH1). 
Notably, 8H1 forms a continuous sheet with the 6-strands in the NTD, 
inserting itself at the interface between the NTD and the SABA domain 
(Extended Data Fig. 3c). Loop S resides between aH2 and 3H3 and 
directly contacts the SABA domain near where this domain binds with 
NPRL2-NPRL3, which could potentially mediate interdomain commu- 
nication (Extended Data Fig. 3d, e). A 8-strand (8H2), which we named 
the ‘critical strip, contacts the nucleotide-binding domain of RAGA 
(Extended Data Fig. 3f, g). This interaction has a unique function and 
is indispensable for normal cellular response to amino acids, and thus 
differentiates the GATOR1-Rag GTPases from other GAP-GTPase 
pairs (see below). 

The CTD (residues 1,291-1,603) of DEPDC5 contains two structurally 
similar lobes and has a pseudo-2-fold rotational symmetry (Extended 
Data Fig. 4a—c). Each half consists of a five-stranded 3-sheet, with an 
a-helix covering one side. The CTD is located in the core of DEPDC5 
and contacts all the other domains of DEPDC5 except the NTD, making 
it the central organizer of this multi-domain protein (Fig. 3a). 

NPRL2 and NPRL3 have similar domain organizations (Extended 
Data Fig. 5a, f). Both contain an N-terminal longin domain (NLD, 
Extended Data Fig. 5b, g) that heterodimerizes (Extended Data Fig. 5k). 
After the NLD, a small domain bridges the longin domain to the 
C-terminal domains (Fig. 3b and Extended Data Fig. 5a). For NPRL2, 
this domain also mediates partial interactions with the SABA domain 
of DEPDCS and we therefore renamed it the TINI domain (tiny inter- 
mediary of NPRL2 that interacts (with DEPDC5)). Besides the longin 
domain interactions, the C-terminal domains of NPRL2 and NPRL3 
form a vast contact surface between each other that further reinforces 
their interaction (Extended Data Fig. 5k-m). 

The Rag GTPase heterodimer shares a similar architecture with 
Gtrlp-Gtr2p (Fig. 3c). The N-terminal regions of RAGA and RAGC 
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contain the guanine-nucleotide binding domains (NBDs, Extended 
Data Fig. 6a). Within the nucleotide-binding pocket of RAGA we can 
clearly observe extra electron microscopy density corresponding to 
GppNHp (Extended Data Fig. 6b). The nucleotide-binding pocket of 
RAGC lacks sufficient resolution to identify the ligand that is bound, 
which is thought to be GDP. RAGA and RAGC heterodimerize via 
their C-terminal roadblock domains (CRD, Fig. 3c and Extended Data 
Fig. 6c), as has also been observed in other mTORC1 pathway compo- 
nents, such as the p14-MP1 heterodimer’. Globally, the nucleotide- 
binding domains of RAGA and RAGC(S75N) are rotated substantially 
farther away from one another than seen in the open state of Gtrlp- 
Gtr2p (Extended Data Fig. 6d)”®°, suggesting that regulation of this 
GTPase heterodimer might have diverged during evolution. 

The structural model also revealed the interactions between the sub- 
units. DEPDC5 directly contacts RAGA and NPRL2, NPRL3 is bound 
to NPRL2 and RAGC to RAGA. Co-immunoprecipitation experiments 
validated these conclusions: in the absence of other GATOR] subunits, 
DEPDCS5 can interact with NPRL2 and the Rag GTPases, and DEPDC5 
co-immunoprecipitated NPRL3 to a much greater extent when NPRL2 
was also co-expressed (Fig. 2f, g). 

To identify the subunits of GATORI needed for it to associate with 
its known partners, we determined the capacity of GATOR] subunits 
to co-immunoprecipitate endogenous GATOR2"’, KICSTOR*%, and 
SAMTOR*’. Overexpressing DEPDCS5 alone is sufficient to bind to 
KICSTOR and SAMTOR (Extended Data Fig. 7a), and NPRL3 is neces- 
sary and sufficient for the interaction with GATOR2 (Extended Data 
Fig. 7a, b). Because SAMTOR is a sensor for S-adenosylmethionine, 
and GATOR? binds the leucine and arginine sensors, these results sug- 
gest that the nutrient availability is transmitted to GATOR through 
various interfaces (Extended Data Fig. 7c). 


An intact GATOR1 is required for its GAP function 
DEPDCS interacts with NPRL2 through the SABA domain (Fig. 4a). 
Among the large number of residues at the contact surface (Extended 
Data Fig. 8a—d), we observed three loops on the tip of the SABA domain 
that directly contact NPRL2, which we defined as loops A (8S1-aS1, red 
in Fig. 4a), B (8S4-GS5, orange in Fig. 4a), and C (8S9-C terminal, blue in 
Fig. 4a). Specifically, loop A contacts a unique hairpin motif (Extended 
Data Fig. 5c) attached to the longin domain of NPRL2, whereas loop 
B and loop C contact the TINI domain of NPRL2 (Extended Data 
Fig. 8e-g). To investigate the roles of these contacts in mediating the 
DEPDC5-NPRL2 interaction, we generated DEPDC5 mutants in which 
these loops were mutated to flexible Gly-Ser(GS) linkers of the same 
length. Mutants replacing any one of the three loops had only a minor 
defect in binding NPRI2Z, as they still co-immunoprecipitated NPRL2 
and NPRL3 in cells (Fig. 4b). However, we observed a strong synergistic 
effect when we replaced both loop A and loop B with GS linkers: the 
compound mutant in which both A and B loops were replaced (‘mutant 
AB)) failed to interact with any NPRL2 and NPRL3 (lane AB in Fig. 4b). 
These results suggest that loop A and B form redundant interactions 
with NPRL2 and are essential for forming an intact GATORI complex. 

We next investigated whether an intact GATOR1 is necessary for 
the appropriate regulation of mTORC1 signalling by nutrients. In 
HEK293T cells lacking DEPDC5, mT'ORC1 signalling, as detected 
by the phosphorylation of its substrate S6K1, is higher than that in 
control cells and is largely resistant to amino acid starvation (Fig. 4c). 
Expression of wild-type DEPDCS in these cells restores normal levels 
of mTORCI1 signalling as well as its sensitivity to amino acids (Fig. 4c). 
By comparison, mutant AB fails to re-sensitize the DEPDC5-null cells 
to amino acid starvation (Fig. 4c). This result suggests that an intact 
GATOR1 is necessary for suppressing mMTORCI activity under nutri- 
ent-deficient conditions. 


DEPDC5-Rag GTPases interaction is inhibitory 
The SHEN domain of DEPDCS5 directly contacts the NBD of RAGA 
(Fig. 5a). In our model, we resolve three pairs of hydrogen bonds 
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(Fig. 5b). Two of them are formed between RAGA and the backbone 
of the critical strip of DEPDC5, suggesting that the 6-strand confor- 
mation of this segment of DEPDC5 may be crucial for mediating the 
interaction. We tested this possibility by investigating how variants of 
DEPDC5 with point mutations in the critical strip (residues 770-778) 
interact with the Rag GTPases in a co-immunoprecipitation assay. 
The DEPDC5(Y775A) mutant severely disrupted the interaction of 
DEPDCS5 with the Rag GTPases (Fig. 5c and Extended Data Fig. 9a, b). 
Considering that the side chain of Tyr775 faces away from RAGA and 
that its backbone does not contact RAGA, we suspected that muta- 


tion of this residue disrupts the conformation of the entire 3-strand. 
Indeed, a much more severe mutation that we call ‘mutant P’—in 
which Tyr775—Pro779 of DEPDC5 was mutagenized to GS linkers 
(YDLLP(775-779)GSGSG)—did not further reduce the DEPDC5- 
Rag GTPases interaction compared with DEPDC5(Y775A) (Fig. 5c). 
During GTP hydrolysis, canonical GAPs insert either an arginine 
finger or an asparagine thumb into the nucleotide-binding pocket of the 
target GTPase**?. We did not observe any extra electron microscopy 
density near the nucleotide-binding domain of RAGA (Extended Data 
Fig. 9c), raising the question of whether the interaction we resolved 
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Figure 5 | The DEPDC5-Rag GTPases interaction represents an 
inhibitory state for GATORI. a. Architecture of the SHEN domain-Rag 
GTPases interactions. b, The critical strip on DEPDC5 mediates the 
interaction with RAGA. Three pairs of hydrogen bonds are shown by 
yellow dashed lines. c, Point mutations in the critical strip of DEPDC5 
impair binding of the Rag GTPase heterodimer to GATORI. P, mutant 

P. d, Single turnover stimulation assay. Scheme for single turnover GTP 
hydrolysis assay to determine the stimulatory effect of GATOR on the 
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Rag GTPases. e, f, Single turnover GTP hydrolysis stimulated by GATORI 
(e) or GATORI1 mutants (f). Dose-dependent GAP activity of wild-type 
GATORI (e) and variants that are defective in Rag GTPase binding (f). 
Representative datasets are shown here, and the statistics are summarized 
in g. g, Summary of kinetic parameters (single turnover, 25 °C) for the 
GAP activity shown in e and f. Mean + s.d. of two to three independent 
experiments is reported. For gel source data, see Supplementary Figure 1. 
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here is the one responsible for stimulating GTP hydrolysis. To test this 
possibility, we purified GATOR variants containing the DEPDC5 
mutants deficient in Rag GTPases binding and tested their GAP activity 
using a single turnover assay (Fig. 5d). Notably, these GATORI vari- 
ants have enhanced GAP kinetics compared to the wild-type complex 
(Fig. 5e-g). For example, compared to wild-type GATORI, the variant 
containing the DEPDC5(Y775A) mutant has a 20- and 15-fold increase 
in keg and K1,2, respectively (Fig. 5f, g), indicating that a weaker interac- 
tion (increased Kj/2) carries out the real GAP function (increased k-,,). 
To further confirm this result, we generated a truncated DEPDCS, con- 
sisting of only residues 1-720, that completely lacks the SHEN domain. 
This truncated version of DEPDCS still forms a complex with NPRL2- 
NPRL3 (Extended Data Fig. 9d), and has elevated hydrolysis kinetics 
similar to those of the GATOR] variant containing DEPDC5(Y775A) 
(Fig. 5f, g). 

To further validate this result, we designed a multiple turnover GTP 
hydrolysis assay (Extended Data Fig. 9e—-h), in which the excess amount 
of Rag GTPases should have the opportunity to occupy the two binding 
modes simultaneously. Wild-type GATOR1 displayed a biphasic 
behaviour in its reaction kinetics: at lower concentrations of the Rag 
GTPases, the hydrolysis rate exhibited a transient plateau (inset in 
Extended Data Fig. 9f). At higher concentrations of the Rag GTPases, 
however, we observed additional stimulation, probably because the 
increased concentration of the Rag GTPases promoted a weaker inter- 
action with a higher GAP activity (Extended Data Fig. 9f). Importantly, 
the initial phase was missing with the DEPDC5(Y775A) mutant 
(Extended Data Fig. 9g). These results suggest that the DEPDC5- 
RAGA contact detected in our structure does not execute the GAP 
activity of GATOR1, which must therefore be performed by an alter- 
native interaction. 


Two binding modes between GATOR] and Rag GTPases 
Based on the above data, we generated DEPDCS in the absence of 
the NPRL2—-NPRL3 heterodimer, and NPRL2—-NPRL3 in the absence 
of DEPDCS5, and then tested the capacity of each to GAP RAGA 
(Extended Data Fig. 9d). DEPDC5 had no activity, but NPRL2- 
NPRL3 robustly stimulated GTP hydrolysis by RAGA (Fig. 6a, b). 
Compared to intact GATOR1, a much higher concentration of 
NPRL2-NPRL3 was required to stimulate RAGA GTP hydrolysis, 
indicating that the absence of DEPDCS5 substantially reduces the bind- 
ing affinity between the Rag GTPases and NPRL2-NPRL3 (Fig. 6a, b). 
Moreover, excess NPRL2-NPRL3 stimulates hydrolysis even in the 
presence of wild-type GATORI, suggesting that DEPDC5 prevents the 
NPRL2-NPRL3 within GATORI from accessing RAGA (Extended 
Data Fig. 9i, j). These results further support the idea that a weaker 
interaction—different from the one we that we observed—carries out 
the GAP function. 

To independently confirm the binding between NPRL2-NPRL3 
and the Rag GTPases, we performed a co-immunoprecipitation assay 
in cells. In cells lacking DEPDC5, the NPRL2-NPRL3 heterodimer 
co-immunoprecipitates the Rag GTPases (Fig. 6c). The interaction was 
enhanced by the presence of the RAGA(Q66L) mutant that prevents 
GTP hydrolysis (Fig. 6c, RAGA*GTP form), as well as by the presence 
of DEPDCS that permits formation of the inhibitory binding mode 
(Extended Data Fig. 10a). 

We further reasoned that if NPRL2-NPRL3 is the unit that acts as a 
GAP and is the receiver for amino acid signals (Extended Data Fig. 7), 
amino acid availability should regulate the interaction between 
NPRL2-NPRL3 and the Rag GTPases. To test this hypothesis directly, 
we pulled down NPRL2-NPRL3 in cells lacking DEPDCS, and probed 
for the Rag GTPases that co-immunoprecipitate with NPRL2-NPRL3 
in the presence or absence of amino acids. Higher amounts of the Rag 
GTPases associated with NPRL2-NPRL3 in nutrient-deprived condi- 
tions (Extended Data Fig. 10b), but not in cells lacking GATOR2, which 
probably conveys amino acid signals to GATOR (Extended Data 
Fig. 10c). These results suggest that amino acid signals are transmitted 
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Figure 6 | A two-state model of the GATORI function. a, b, Single 
turnover GTP hydrolysis gives dose-dependent GAP activity of GATOR] 
subunits DEPDC5 and NPRL2-NPRL3 (a), with quantification of rate 
enhancement by GATORI subunits (b). A representative dataset is 
shown in a, and the statistics are summarized in b. Mean +s.d. of three 
independent experiments is shown. c, Interaction between NPRL2-NPRL3 
and the Rag GTPases in the absence of DEPDCS. W, wild-type RAGA 

or RAGC; T, RAGA(Q66L) mutant. d, A two-state model showing the 
equilibrium between GATORI and the Rag GTPases. Both the inhibitory 
mode and the GAP mode are required for regulating mTORCI1 activity. 

e, Expression of a DEPDC5 mutant that is defective in Rag GTPases 
binding further suppresses mTORCI activity in DEPDC5-null cells. 

AB, mutant AB. Data in c and e are representative of two independent 
experiments. For gel source data, see Supplementary Figure 1. 


through GATOR2 to NPRL2-NPRL3 to directly regulate the Rag 
GTPases. 

These results led us to conclude that at least two interaction modes 
must exist between the Rag GTPases and GATORI (Fig. 6d): an 
inhibitory mode characterized by a strong binding affinity between the 
Rag GTPases and the DEPDC5 SHEN domain, but a low GAP activity; 
and an alternative ‘GAP mode with the opposite characteristics. This 
proposal raised the question of the biological relevance of the inhibitory 
mode captured by our structure, as no similar behaviour has been pre- 
viously observed for a GAP. To probe this question, we tested the effects 
on mTORCI signalling of expressing DEPDC5 mutants deficient in Rag 
GTPases binding. We reasoned that if, as detected in vitro (Fig. 5f, g), 
the inhibitory mode suppresses the GAP activity of GATOR1 in cells, 
we should observe lower mTORC1 signalling (that is, enhanced GAP 
activity) when we eliminate it. This is indeed the case: in cells express- 
ing mutant P of DEPDC5, mTORC1 signalling was more suppressed 
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than in those expressing wild-type DEPDCS5 even under nutrient-rich 
conditions (Fig. 6e). Moreover, this increased degree of inhibition 
requires NPRL2-NPRL3, as we saw no difference between mutant 
P and wild-type DEPDCS in cells lacking NPRL2 (Extended Data 
Fig. 10d), which further supports the notion that the NPRL2-NPRL3 
heterodimer carries out the GAP activity of GATORI. We therefore 
conclude that the inhibitory mode between GATORI and the Rag 
GTPases operates within cells and serves to prevent GATORI hyper- 
activation to maintain the proper response of mTORCI to nutrients. 


Summary 

In this study we present cryo-EM structures for GATOR1 and the 
GATORI-Rag GTPases complex. Our work suggests that at least two 
binding modes exist between GATOR] and the Rag GTPases and that 
both are required for mTORCI signalling to respond normally to nutri- 
ents. The inhibitory mode we have identified distinguishes GATOR1 
from canonical GAPs and represents an unforeseen mechanism for how 
cells suppress mMTORC1 activity under nutrient-deficient conditions. 


Data availability Atomic coordinates and structure factors have been deposited 
in the RCSB Protein Data Bank (PDB) with accession numbers 6CET for GATOR1 
and 6CES for GATOR1-Rag GTPases. Electron density maps have been deposited in 
Electron Microscopy Data Bank with accession numbers EMD-7465 for GATOR1 and 
EMD-7464 for GATOR1-Rag GTPases. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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Extended Data Figure 1 | Structural determination and model building 
for the GATOR1 and GATOR1-Rag GTPases complex. a, Gel filtration 
profiles for GATORI (red line) and GATOR1 + RAGA(T21N)-RAGC 
(orange line). The peak position for GATOR1 does not shift upon 
incubation with RAGA(T21N)-RAGC, which suggests that there is no 
direct binding between the two complexes. b, Coomassie blue stained 
SDS-PAGE analysis of the two peaks on the GATORI1 + RAGA(T21N)- 
RAGC elution profile. No co-elution is observed. Asterisk denotes a non- 
specific band that co-purifies with GATORL. ¢, d, Raw cryo-EM images 
for GATORI (c) and the GATOR1-Rag GTPases complex (d). Discrete 
particles were clearly visualized under the microscope. Scale bars, 50 nm. 


GATOR1-RAG GTPases 2D-clustering 


GATOR1 


GATOR1 
-RAG GTPases 


DEPDC5 1244-1257 


AS 
ANZ 


Phe1251 
ye AN 


Veini2a6 PS. 
NR e258 


Xs 


k N 
VSS Phe1255 


Phe1257 eC 
ie 


e, f, 2D clustering of GATORI (e) and GATORI1-Rag GTPases (f). Yellow 
arrows in f point to the extra electron microscopy densities in comparison 
to e. * and § mark the particles shown in g. Scale bars, 5nm. g, Direct 
comparison of particles from 2D clustering of GATOR1 and GATORI1- 
Rag GTPases. Extra electron microscopy densities for the Rag GTPases can 
be directly observed. h-j, Extracted regions from the electron microscopy 
density maps and the fitted structures for a-helical (h), 3-strand (i) and 
loop (j) regions of DEPDCS5. Secondary structures and bulky side chains 
can be unambiguously resolved at the current resolution. Data in a and b 
are representative of two independent experiments. 
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Extended Data Figure 2 | Architecture of DEPDC5. a, b, Two views 

of DEPDCS. The protein backbone is depicted in rainbow colours from 

the N terminus (blue) to the C terminus (red). Binding sites for NPRL2- 
NPRL3 and the Rag GTPases are marked. c, d, Structural model (c) and 

topological diagram (d) for DEPDC5 NTD. e, Lobe B of DEPDC5 NTD 

shares structural similarity to the NTD of the PEX1 AAA-ATPase. 

f-i, The SABA domain of DEPDC5 (f) shares topological similarity (g) 


PEX1 (partial) 
PDB:1WLF 


Flavodoxin 


pevocon Mn?* binding site 
binding site 


Flavodoxin Reductase CD11a |-domain 
PDB:1FDR PDB:1LFA 


to flavodoxin reductase (h) and CD11aI domain (i), which all contain 
ligand-binding sites (indicated by arrows). The SABA domain contains a 
6-sheet insertion formed by three strands. The three loops in the SABA 
domain of DEPDCS that mediate the DEPDC5-NPRL2 interaction are 
coloured in red (loop A), orange (loop B), and blue (loop C), respectively, 
on the topological diagram. Nt, N terminus; Ct, C terminus. 
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Extended Data Figure 3 | Architecture of the SHEN domain of e, Loop S (purple) mediates interdomain contact with the SABA domain 
DEPDGCS. a, Electron microscopy density map and structural model for of DEPDC5, close to where the NPRL2-NPRL3 dimer binds to DEPDC5. 
the SHEN domain. b, Topological diagram for the SHEN domain. c, BH1 f, 8H2 (which we named the critical strip) of the SHEN domain directly 
on linker S forms a continuous sheet with the B-strands on lobe B of contacts RAGA (pink). g, Electron microscopy density map and the atomic 
the NTD, and positions itself between the NTD and the SABA domain. model for the critical strip. 


d, Electron microscopy density map and structural model for loop S. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


DEPDc5c"? 


90° - | 


1g0°<> 


Lobe 2 Lobe 1 


Extended Data Figure 4 | Architecture of the CTD of DEPDC5. a, b, Structure (a) and topological diagram (b) for the CTD of DEPDC5. C, The CTD 
of DEPDCS5 shows a pseudo two-fold rotational symmetry. Two lobes with similar folds can be seen. 
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Extended Data Figure 5 | Architecture of NPRL2 and NPRL3. 
a, Structural model of NPRL2. Contact surfaces with DEPDC5 and NPRL3 
are indicated by arrows. A long linker connects the longin domain and 
the TINI domain, with electron microscopy density shown as mesh. The 
atomic model for this linker is shown in d. b, Longin domain of NPRL2. 
A standard longin domain from LST4 is shown for comparison. ¢, A strand- 
turn-strand motif (hairpin) is attached to the longin domain of NPRL2, 
which mediates partial interaction with DEPDCS. d, Electron microscopy 


density map and atomic model for the linker connecting the longin 
domain and the TINI domain (see the electron microscopy density in a). 
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e, Structural model for the CTD of NPRL2. f, Structural model for NPRL3. 
Contact surfaces with NPRL2 are indicated by arrows. g, Longin domain 
of NPRL3 and its overlap with the longin domain of NPRL2. h, Structural 
model for the TINI domain of NPRL3 that connects its longin domain 
with the CTDs. i, Structural model for the intermediary (INT) domain 

of NPRL3. j, Structural model for the CTD of NPRL3. k-m, Interactions 
between NPRL2 and NPRL3. Three contact surfaces were identified that 
mediate the interactions between NPRL2 and NPRL3: the longin domains 
of NPRL2 and NPRL3 (k), the TINI domain of NPRL2 and CTD of NPRL3 
(1), and the CTD of NPRL2 and the INT domain of NPRL3 (m). 
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Extended Data Figure 6 | Architecture of the Rag GTPase heterodimer. 
a, NBDs of RAGA (pink) and RAGC (cyan) overlap with those of Gtrlp 
and Gtr2p (grey). b, Extra electron microscopy density can be observed 
in the nucleotide-binding pocket of RAGA, into which GppNHp can be 
fitted. c, The CRD of RAGA and RAGC tightly dimerize with one another. 
The dimerized roadblock domains from Gtrlp-Gtr2p and p14—MP1 
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are shown for comparison. d, Global conformation of the Rag GTPase 
heterodimer in comparison to the two crystal structures of Gtrlp-Gtr2p. 
RAGA and Gtr1p are aligned. Rotational movement of the NBD of RAGC 
is illustrated, and compared with the NBD of Gtr2p in the direction of 
aN5. The NBDs of the Rag GTPases rotate further away from one another, 
even when compared with the open conformation of Gtrlp—Gtr2p (top). 
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Extended Data Figure 7 | GATOR1 orchestrates amino acid signalling 
on the lysosomal surface. a, Co-immunoprecipitation of GATOR2, 
KICSTOR and SAMTOR components by overexpressed GATOR] in 
HEK293T cells. DEPDCS5 by itself is sufficient to pull down endogenous 
KICSTOR components and SAMTOR. NPRL3 is necessary to pull down 
GATOR2 components. b, Co-immunoprecipitation of endogenous 


GATOR2 components by overexpressed NPRL2-NPRL3 in cells lacking 
DEPDC5. The NPRL2-NPRL3 dimer is sufficient to pull down GATOR2, 
and additional DEPDCS5 causes no further effect. c, Signals from amino 
acids and metabolites are transmitted to GATORI through various routes. 
Data in a and b are representative of two independent experiments. 
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Extended Data Figure 8 | Interactions between DEPDC5 and NPRL2. participate in mediating interactions between the two proteins, identified 
a, b, Large contact surfaces between DEPDC5 (green) and NPRL2 by ‘InterfaceResidue’ script in Pymol. e-g, Loops A (e), B (f) and C (g) on 
(yellow) are observed from the electron microscopy density map and DEPDCS directly contact NPRL2. 


structural models. c, d, Surface residues on NPRL2 (c) and DEPDC5 (d) 
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Extended Data Figure 9 | See next page for caption. 
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Extended Data Figure 9 | In vitro characterization of the GAP 
mechanism of GATORI. a, Gel filtration profiles for DEPDC5(Y775A)- 
NPRL2-NPRL3 (blue line) and DEPDC5(Y775A)—-NPRL2- 

NPRL3 + RAGA-RAGC(S75N) in the absence (orange line) or presence 
(red line) of the crosslinker glutaraldehyde. Peak A denotes the species 
eluted at the large molecular weight region. b, Coomassie blue stained 
SDS-PAGE analysis of peak A. Direct binding is only observed in the 
presence of glutaraldehyde. Asterisk denotes a non-specific band that 
co-purifies with GATORI. c, No extra electron microscopy density can 
be observed near the NBD of RAGA. d, GATOR! variants visualized by 
SDS-PAGE followed by Coomassie blue staining. Asterisk denotes a non- 
specific band that co-purifies with GATOR1. e, Scheme for measuring 
stimulated GTP hydrolysis by GATOR] in a multiple turnover setup. An 
excess amount of Rag GTPases singly loaded with GTP was incubated 
with fixed amount of GATORI. The hydrolysis reaction was traced and 
quantified. f, Stimulated GTP hydrolysis by wild-type GATOR1 shows a 
biphasic behaviour in reaction kinetics. As an increasing amount of the 
Rag GTPases was included in the reaction, a small plateau of observed 
rate constant (Kobsq) was first observed at a lower concentration (inset). 
Such biphasic behaviour indicates that two binding modes exist in the 
wild-type GATORI: one with higher affinity to the Rag GTPases but 
lower GAP activity, the other with lower affinity but higher GAP activity. 


ARTICLE 


A representative dataset is shown in this panel, and the statistics are 
summarized below. g, Stimulated GTP hydrolysis bya GATORI mutant 
that is defective in Rag GTPases binding eliminates the initial phase. 
DEPDC5(Y775A)—NPRL2-NPRL3 is defective in stable Rag GTPases 
binding because it lacks the docking site (an intact critical strip) for 

the Rag GTPases. Consequentially, the inhibitory mode diminishes 
(inset), leaving a single phase corresponding to the GAP mode in 
reaction kinetics. A representative dataset is shown, and the statistics are 
summarized below. h, Summary of kinetic parameters for the multiple 
turnover GAP activity shown in f and g. Mean + s.d. of two to three 
independent experiments is reported. i, A stimulated-chase assay to 
characterize the inhibition mechanism of DEPDCS5. Wild-type GATOR1 
was first added to bind the Rag GTPases with its inhibitory mode. Extra 
NPRL2-NPRL3 was then included in the reaction as a chase. We reasoned 
that if DEPDC5 sequesters the NBD of RAGA, no further stimulation 
should be observed; if DEPDC5 simply prevents NPRL2-NPRL3 from 
accessing the NBD of RAGA, we should observe additional stimulation 
because there is no DEPDCS to inhibit the extra NPRL2-NPRL3. j, 
Further stimulation is observed in the presence of additional NPRL2- 
NPRL3, as reflected by the faster hydrolysis rate (steeper slope), suggesting 
DEPDCS inhibits NPRL2-NPRL3 in cis. Data in a, b, d and j are 
representative of two independent experiments. 
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Extended Data Figure 10 | In vivo characterization of the GAP immunoprecipitates with NPRL2-NPRL3 in the absence of amino acids. 
mechanism of GATORI. a, Interaction between NPRL2-NPRL3 and c, Loss of regulated interaction between NPRL2-NPRL3 and the Rag 
the Rag GTPases is enhanced by wild-type DEPDCS5 but not mutant P, GTPases in cells lacking DEPDC5 and MIOS. No difference is observed 
which is defective in binding to the Rag GTPases. W, wild-type DEPDC5; when GATOR2, the receptor for amino acid signals, is knocked out. 
P, mutant P. Asterisk denotes a non-specific band. b, Amino acid d, Expression of a Dedpc5 mutant that is defective in Rag GTPases binding 
availability regulates the interaction between NPRL2-NPRL3 and the Rag _has no effect in NPRL2-null cells, in sharp contrast to the result in Fig. 6e. 
GTPases in cells lacking DEPDC5. Higher amount of Rag GTPases co- Data in a-d are representative of two independent experiments. 
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A density cusp of quiescent X-ray binaries in the 
central parsec of the Galaxy 


Charles J. Hailey!, Kaya Mori', Franz E. Bauer?*+, Michael E. Berkowitz!, Jaesub Hong® & Benjamin J. Hord! 


The existence of a ‘density cusp’!*—a localized increase in 
number—of stellar-mass black holes near a supermassive black 
hole is a fundamental prediction of galactic stellar dynamics*. The 
best place to detect such a cusp is in the Galactic Centre, where 
the nearest supermassive black hole, Sagittarius A*, resides. As 
many as 20,000 black holes are predicted to settle into the central 
parsec of the Galaxy as a result of dynamical friction**; however, 
so far no density cusp of black holes has been detected. Low-mass 
X-ray binary systems that contain a stellar-mass black hole are 
natural tracers of isolated black holes. Here we report observations 
of a dozen quiescent X-ray binaries in a density cusp within one 
parsec of Sagittarius A*. The lower-energy emission spectra that 
we observed in these binaries is distinct from the higher-energy 
spectra associated with the population of accreting white dwarfs that 
dominates the central eight parsecs of the Galaxy®. The properties 
of these X-ray binaries, in particular their spatial distribution and 
luminosity function, suggest the existence of hundreds of binary 
systems in the central parsec of the Galaxy and many more isolated 
black holes. We cannot rule out a contribution to the observed 
emission from a population (of up to about one-half the number of 
X-ray binaries) of rotationally powered, millisecond pulsars. The 
spatial distribution of the binary systems is a relic of their formation 
history, either in the stellar disk around Sagittarius A* (ref. 7) or 
through in-fall from globular clusters, and constrains the number 
density of sources in the modelling of gravitational waves from 
massive stellar remnants®’, such as neutron stars and black holes. 
The Chandra X-ray Observatory has accumulated 1.4 x 10° s 
of observations of the Galactic Centre using the Advanced CCD 
Imaging Spectrometer I (ACIS-I) over the past 12 years. Owing to the 
high concentration of X-ray sources in the Galactic Centre (such as 
Sagittarius A* (hereafter Sgr A*) and IRS 13) and the emission from 
hot gas, we restrict our analysis to angular distances of more than 
5" from Sgr A*, which correspond to projected distances r of more than 
0.2 pc assuming an 8-kpc distance to the Galactic Centre'®. From these 
Chandra observations, 415 X-ray point sources at projected distances of 
0.2 pe<r<3.8 pc from Sgr A* were identified in the 2-8-keV energy 
band, and used for all analysis. Point sources were ignored if they were 
in diffuse regions such as filamentary structures or molecular clouds, 
were too close to very bright point sources or were previously known 
to have large X-ray outbursts. A net count (source minus background 
counts) limit of C> 100 (equivalent to a flux of 2 x 107 erg cm~* s~1) 
was used for the analysis. This count limit corresponds to a point source 
detection significance of 40. There are 92 sources with C > 100, 26 of 
which lie at r< 1 pc. For spectral fitting, a limit of C> 200 was used 
in order to constrain spectral parameters effectively: 13 sources with 
r<1pc met this criterion. This is too few sources for population 
analysis, so we instead use a hardness ratio of X-ray colour, or slope, 
HR2= (Cy — C,)/(Cy + C_), where Cy and Cy are, respectively, the 
net counts from the source in the 2-4-keV (low) and 4-8-keV (high) 


energy bands. Previous Chandra analysis'! and simulations reported 
here demonstrate that the thermal emission of magnetic cataclysmic 
variable stars—which have typical temperatures of kT ~ 8-40 keV—in 
the Chandra energy band of 0.5-8 keV is well modelled by a single- 
temperature, optically thin thermal plasma and an iron line 
complex at an energy of E=6-7 keV. By contrast, black-hole and 
neutron-star binaries and pulsars are universally characterized by 
non-thermal power-law emission with a flux described by F(E) = kET 
photons cm~? s~! keV~1, where E is the photon energy in keV and 
['~1.5-2 (ref. 12) is the photon index. The low- and high-energy bands 
for HR2 were chosen to maximally separate thermal and non-thermal 
sources. Thermal sources have HR2 values of more than 0.3, whereas 
non-thermal sources have HR2 values of less than 0.3. 

In Fig. 1a, b we show the HR2 distributions for sources with C> 100 
that lie in the circle r< 1 pe and in the annulus 1 pc<r<3.8 pc, respec- 
tively, centred on Sgr A*. A second source population clearly appears 
at r<1 pc in the range —0.1 <HR2 < 0.3. Integrating the HR2 distri- 
bution in the annular region for HR2 < 0.3, we would expect at most 
one source with HR2 <0.3 for r< 1 pc ifthe sources in the inner region 
are drawn from the same HR2 distribution as that for the sources in the 
annulus; however, there are 12 such sources. 

The diffuse, hard X-ray emission in the Galactic Centre, which 
dominates the inner 8 pc of the Galaxy’®, is due to unresolved magnetic 
cataclysmic variable stars (in particular, a sub-class known as interme- 
diate polars) with optically thin thermal emission of kT + 8-40 keV 
(ref. 13). These intermediate polars are the origin of the HR2 >0.3 
population. The newly identified HR2 < 0.3 population is prominent 
only inside the central parsec. 

Before concluding that the HR2 probability distribution in the inner 
parsec is due to a newly identified source population, we studied the 
confounding effects of the high concentration of gas and dust in the 
Galactic Centre. The high dust column density could preferentially 
scatter soft X-rays or lead to the well-known degeneracy between 
the spectral hardness and interstellar absorption of a source’. Both 
of these effects could alter the HR2 distribution near Sgr A*. They 
can be probed using a second, lower-energy-band hardness ratio to 
produce colour-colour plots, so-called quantile diagrams, and spectral 
simulations of thermal and non-thermal sources. All of these analyses 
demonstrate that the HR2 < 0.3 sources cannot be HR2 > 0.3 sources 
masquerading as apparent HR2 < 0.3 sources as a result of scattering 
or column-density effects (see Methods). 

The stacked spectra of the HR2 < 0.3 and HR2 > 0.3 sources in the 
inner parsec are compared in Fig. 2. The comparison provides further 
confirmation of the spectrally distinct nature of the HR2 < 0.3 sources 
in the inner parsec. The HR2 > 0.3 sources are well fitted by a thermal 
spectrum with partially covered absorption to account for an accretion 
curtain and scattering from the surface of a white dwarf—a model that 
is commonly applied to intermediate polars. The best-fitting 
temperature (kT = 6.3} keV, where the errors here and elsewhere 
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Figure 1 | Hardness ratio (HR2) distribution of X-ray point sources 
with net counts of C > 100 at a projected radial distance from Sgr A*. 
a, Sources for r< 1 pc. b, Sources for 1 pc <r <3.8 pe. A substantial 
population of sources with —0.1 <HR2< 0.3 appears for r< 1 pce (a). Of 
the 66 sources in the annulus (b), 2 (3%) have HR2 < 0.3, whereas of the 26 
sources in the inner circle (a), 12 (46%) have HR2 < 0.3. A Kolmogorov- 
Smirnov test shows that the HR2 cumulative distribution function of the 
annular region is not consistent with that of the circular region (D = 0.528; 
P=3.0 x 1075). The two outliers in b, at HR2 = —0.56 and HR2 = —0.74, 
were included in the Kolmogorov-Smirnov test. They account for at 

most one HR2 < 0.3 source in a, with a probability of about 2%. A 7 test 
based on the lack of spatial constancy in HR2 between 1 pe<r< 3.8 pe 
and r< 1 pcalso indicates a different source population for r< 1 pc 
(P=1.1 x 107'%). The Kolmogorov-Smirnov test, outliers and the ” test 
(Extended Data Fig. 3) are further discussed in Methods. 


correspond to one standard deviation) is consistent not only with the 
temperatures of the HR2 > 0.3 sources at 1 pc <<r< 3.8 pc, but also 
more generally with the low-temperature component of the (unre- 
solved) hard X-ray emission and the typical Chandra-measured 
temperature found for magnetic cataclysmic variable stars in the 
Galactic Centre). By contrast, the stacked spectrum of the HR2 < 0.3 
sources in the inner parsec is well fitted (with a reduced chi-squared of 
x7, =0.81 for 51 degrees of freedom (d.o.f.)) by a non-thermal 
power-law with = 1.9*}'3, with no sign of neutral or ionized iron lines 
in the spectrum. A final confirmation of the newly identified, spectrally 
distinct source population in the inner approximately 1 pc comes from 


| 
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the 13 sources in the central parsec that are bright enough (C > 200) to 
enable spectral fitting. The HR2 < 0.3 sources are well fitted with a 
non-thermal spectrum and the HR2 > 0.3 sources with an intermediate- 
polar thermal model (see Methods for discussion and Extended Data 
Fig. 1 for examples). 

The spatially distinct morphology of this newly identified source 
population is shown in Fig. 3, with the HR2 < 0.3 sources concentrated 
in the central parsec and the HR2 > 0.3 sources extended over the 
region of the diffuse, hard X-ray emission. The newly identified popu- 
lation is not extragalactic because the number of background active 
galactic nuclei (about 0.1)’* is negligible within the inner approximately 
1 pc above the 2-8-keV flux threshold. The non-thermal spectrum, 
with [’~ 1.5-2 and lacking iron lines, clearly distinguishes this popu- 
lation from the population of thermally emitting intermediate polars 
that dominate at larger radii. We rule out neutron-star, low-mass X-ray 
binaries as candidates for the newly identified population because of 
the short recurrence time of about 5-10 years between their large X-ray 
outbursts. The continuous monitoring of the Galactic Centre for more 
than a decade is believed to have revealed all such binaries!”!8, and 
none of the point sources discussed here has ever produced an outburst. 
High-mass X-ray binaries comprise about 40% of all Galactic X-ray 
binaries’” and, given the large concentration of O and B stars in the cen- 
tral parsec!’, are potential contributors to the newly identified source 
population. However, infrared surveys that cover all of the Chandra 
X-ray sources in the Galactic Centre are able to rule out massive stellar 
companions”*”!, from bright O or B supergiants down to the faintest of 
the Be high-mass X-ray binaries (spectral class B2V), effectively elimi- 
nating the possibility of a contribution of high-mass X-ray binaries. 
Coronally active isolated stars and stellar binaries having soft thermal 
spectra (corresponding to HR2 < —0.3) are not observed among our 
sources in the central approximately 1 pc; in addition, their expected 
luminosities are well below the Chandra detection threshold. During an 
outburst they may have harder X-ray spectra, but would still be below 
the Chandra detection threshold, and their numbers are substantial 
only in the inner approximately 0.1 pc (ref. 22). 

The most plausible explanation for the non-thermal sources in the 
inner parsec is that they are quiescent black-hole low-mass X-ray 
binaries (qBH-LMXBs). Although qBH-LMXBs can exhibit modest 
variability”’, there is limited information about their long-term var- 
iability, so steady emission from qBH-LMXBs cannot be ruled out. 
Timing analysis of the non-thermal sources in the inner parsec 
over the 12 years of observations shows that 6 out of 12 of them are 
variable, whereas the remaining 6 are steady. Rotation-powered 
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Figure 2 | Stacked Chandra spectra for X-ray sources within the inner 
parsec. a, Sources with HR2 > 0.3. b, Sources with HR2 < 0.3. The 

HR2 > 0.3 spectrum (a) for the 8 sources with strong Fe lines is well 

fitted by an optically thin, thermal plasma model with temperature 
kT=6.37}'3 keV O? = 1.25 for 36 d.o.f.). A power-law model yields a poor 
fit (x? = 1.90 for 39 d.o.f.; P= 0.7702 ). The 6 sources with weak or no Fe 
lines have a comparable best-fit photon index of P= 0.67)'t). These 
photon indices are typical of intermediate-polar spectra fitted with a 
power-law model in the Chandra energy bandpass!!3, The HR2 < 0.3 
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spectrum (b) is well fitted by a power-law model with a significantly softer 
photon index ("= 1.9*)'3 for 51 d.o.f.). Stacked spectra of the 14 

HR2 > 0.3 sources (Extended Data Fig. 7) were fitted to a thermal plasma 
model with the best-fit temperature kT = 7.31} keV x? = 0.80 for 58 
d.o.f.), the typical Chandra-measured temperature (about 7-9 keV) for 
magnetic cataclysmic variables in the Galactic Centre®. The error bars 
represent lo statistical uncertainties. The bottom panels show residuals 
(data minus model) in terms of lo significance. For more details on the 
stacked spectra analysis, see Methods. 
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Figure 3 | Chandra 2-8-keV image of the Galactic Centre with X-ray 
sources with C > 100 overlaid. Thermal (HR2 > 0.3) and non-thermal 
(HR2 < 0.3) sources are indicated with red and cyan circles, respectively. 
The colour scale indicates the number of 2-8-keV counts per pixel. The 
inner and outer yellow circles delineate the 0.2 pc <r<1 pc region around 
Sgr A*. The inner region was excluded from the analysis. The magenta 
ellipse (7.8 pc x 3.9 pe, full-width at half-maximum) bounds the region of 
spatially unresolved hard X-ray emission that was discovered by NuSTAR 
and is due to thermal emission from intermediate polars. The non-thermal 
sources cluster inside the inner parsec, whereas the thermal sources are 
distributed more uniformly throughout the hard X-ray emission. The 
Galactic plane runs along the semi-major axis of the ellipse. The dearth of 
sources to the north of the Galactic plane is due to strong extinction from 
molecular clouds (such as the circumnuclear disk), which densely populate 
that region?’. This may reduce the estimated number of sources by a factor 
of a few. Some bright sources without circles are X-ray transients, including 
the outbursting neutron-star low-mass X-ray binary AX J1745—2901 in 

the lower right corner. For reference, Chandra’s angular resolution is 

0.5 arcsec, which corresponds to about 0.025 pc at the Galactic Centre. 


millisecond pulsars (rMSPs) also have non-thermal emission consistent 
with HR2 < 0.3. However, rMSPs are always steady over timescales 
of months to years”4, so we cannot rule out that up to one-half of the 
non-thermal sources that we observe in the inner parsec are rMSPs. 
In Fig. 4 we show the surface density of the 12 non-thermal sources 
as a function of projected radius from Sgr A*, from which we extract 
a three-dimensional cusp power-law index. The cusp index does not 
change in a statistically significant fashion if the six qBH-LMXB can- 
didates are analysed separately from the six rMSP candidates (see 
Methods for discussion). Owing to the complexities of the formation 
and evolution of BH-LMXBs, the agreement with the isolated black- 
hole prediction is probably coincidental. However, the concentration 
of black holes in the inner approximately 1 pc (well inside the influence 
radius at r 3 pc) that we observe has been predicted previously*. In 
Extended Data Fig. 2 we show the logN-logS (number versus X-ray 
flux) for the GBH-LMXB candidate sources with r< 1 pc and down to 
the Chandra flux limit. A power-law fit to the logN-logs distribution 
(N(>S)=kS~°, a= 1.870%) can then be extrapolated to the minimum 
flux of the local, known BH-LMXB population (see Methods) to give 
600-1,000 qBH-LMXBs (lo error on the fit) if all of the observed 
HR2 <0.3 sources are GBH-LMXBs, and 300-500 if one-half of them 
are rMSPs. The lack of large outbursts from individual sources over 
more than a decade suggests outburst recurrence times of about 1,000 
years, for which there is some theoretical motivation (see Methods). 
If the density cusp extends to Sgr A*, then the qBH-LMXB counts 
must be corrected for the unobserved volume inside 0.2 pc. For the best- 
fitting a and three-dimensional cusp index 4, this correction would 
increase the total number of qBH-LMXBs for r < 1 pc by about 50%. 
To estimate an upper limit for the total number of rMSPs we use a 
previous result®, obtained by using the measured correlation between 
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Figure 4 | Surface density of the 12 non-thermal sources as a function of 
projected radius from Sgr A*. Sources with C> 100 and C> 50 are 
indicated in black and red, respectively. The cusp power-law index ¥ of the 
non-thermal sources (HR2 < 0.3) was obtained by using an assumed three- 
dimensional form for the source density of n(r) = kr~” and projecting this 
along the line-of-sight radius R to obtain a best fit to the surface density 
3(R)=kR~-°. The best fit yields y= 2.479'3. The isolated black-hole cusp 
around Sgr A* is predicted to have a power-law index ¥ in the approximate 
range 1.3-2.3 (refs 26, 28-30). The red histogram is for sources with 

C> 50 and gives a crude estimate of the surface density at larger radii, 
although it may suffer from mild contamination from spurious 
background sources. The best-fitting power-law index in this case is 

y= 2.0) 5. There is no statistically significant difference in the power-law 
index if the six rMSP candidates are removed from the analysis. See 
Methods for more details of the cusp analysis. 


the spin-down power E and the X-ray luminosity Lx for a large sample 
of rMSPs, that the fraction of rMSPs above the Chandra flux threshold 
at the Galactic Centre is about 3%. We therefore set an upper limit of 
approximately 200 for the number of rMSPs in the Galactic Centre. 

Our estimated number of BH-LMXBs is a lower limit for the number 
of isolated black holes in the Galactic Centre. The total number of low- 
mass X-ray binaries is consistent with a previous estimate’ based on 
three-body exchange formation in in-falling globular clusters; however, 
it has been argued” that this previous estimate is an overestimate by 
10-100. If that is the case, then it is most likely that the black-hole 
binaries were formed via the tidal capture of old, low-mass stars (either 
orbiting Sgr A* or formed in the surrounding stellar disk), and the 
observed number of BH-LMXBs necessitates the existence of more 
than about 10,000 isolated black holes (A. Generozov, B. Metzger, 
N. Stone and J. Ostriker, manuscript in preparation). 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

We used data from the Chandra X-ray Observatory (CXO). Following previous 
survey analyses’! only data from the ACIS-I instrument were used. There were 
45 observations pointing at or near Sgr A* providing a total cleaned exposure of 
1.4 Ms. This is a 40% increase in exposure time over the most recent and updated 
(hereafter Muno) source catalogue", although all of the sources we analysed are 
listed in the Muno catalogue. 1a errors are quoted for parameter uncertainties 
throughout the paper. 

Point source selection. We selected all point sources in the Muno catalogue within 
a circle of radius r= 100" (about 4 pc) centred on Sgr A*. We checked to ensure that 
these point sources were not embedded in X-ray filaments or molecular clouds. 
We excluded all known sources of transient outbursts such as neutron-star low- 
mass X-ray binaries (NS-LMXBs). The region at r< 5” (less than about 0.2 pc) 
was also excluded because of very high background contamination and owing to 
the complex X-ray morphology that enhanced difficulties with isolating the true 
properties of the point sources; this region includes Sgr A*, the IRS 13 star-forming 
complex and PWN G359.96—0.04. This left 415 point sources from the catalogue 
out to r< 100”. Sources with C> 200 (41 sources, 10%) after background subtrac- 
tion could be spectrally fitted individually. Analysis with hardness ratio (colour) 
was possible for sources with C> 50 (211 out of 415, 51%). However, most of our 
analysis was done for sources with C> 100, because it is increasing difficulty to 
distinguish between hard and soft sources as the count rate decreases. 

Source and background extraction. For each source, ACIS Extract (AE) 
(version AE 2016Sept22; ref. 31) was used to extract source photons from a 
circle containing 90% of the encircled energy fraction (EEF), typically with 
1” radius, centred on each source. All available observations from years 
2003-2014 were used. For each Chandra observation, AE provides event and 
image files for the source, spectral files including source and background spectra, 
response matrix and effective area files, and light curves. We investigated various 
methods of background extraction and adopted the following source/background 
extraction methodology as optimal (and later validated by our MARX simulations) 
in the crowded Galactic Centre region where background is not spatially uniform. 
For background subtraction, AE defines an inner annulus at 1.1 times the 99% EEF 
radius (typically 2.5”) so that source photons from the wings of the point spread 
function do not strongly contaminate background estimates. The outer radius of 
the background annulus is increased until the ratio of its area to the area of the 
source extraction region reaches an optimum preset value (= 5) as recommended 
in AE. The outer radius of the background annulus can vary over a range of about 
3.5" to 5” because as additional sources are encountered from the inputted region 
source list, AE automatically excludes those sources and subsequently increases 
the outer radius until the optimum preset value of background-to-source area ratio 
is reached. After AE processing, we further excluded point sources with very high 
background counts (usually from other nearby point sources) from our analysis. 
Extended Data Fig. 4 illustrates our source and background extractions for one of 
the 12 soft sources (source 5 in Extended Data Table 1) located in the Rockefeller 
Ridge, the most source-crowded region we analysed. 

Spectral analysis. We applied four different methods of spectral analysis: the 
Chandra-defined hardness ratio (such as X-ray colour or slope) HR2 = (Cy — C,)/ 
(Cy + Cy), where Cy and C, are the net counts in the 2-4-keV and 4-8-keV bands, 
respectively; colour-colour diagrams using HR2 and HR3 = (Cy — C,)/(Cu + Cy); 
where Cy and C, are now the net counts in the 1-3-keV and 5-8-keV bands; 
spectral fitting of individual sources, with net count C> 200; and spectral fitting 
of stacked source spectra. We performed all spectral fitting and calculated hardness 
ratios using XSPEC version 12.9°. We adopted the thabs model and the previously 
reported* abundance data to take into account interstellar absorption. In addi- 
tion, the Chandra simulation tool (MARX version 5.3.1) was used extensively**. 
MARX allows simulated sources with pre-defined spectra to be added directly 
into the multiple-observation Chandra data that we analysed, with their actual 
backgrounds, in the Galactic Centre. MARX was used to simulate non-thermal 
power-law spectra to assess efficiency for recovering the proper photon index 
and hardness ratios as a function of source brightness, in the actual Chandra 
Galactic Centre data. Finally, the standard XSPEC simulation tool fakeit was used 
to determine HR2 and HR3 for various source spectra, to investigate the effects of 
dust scattering on hardness ratios and power-law photon indices and to investigate 
any degeneracy between photon index determination and column density. 
Hardness ratio analysis. We use hardness ratio analysis to determine whether the 
point source population is composed of spectrally distinct components. Hardness 
ratios permit assessment of spectral properties of fainter sources than would be 
possible with spectral fitting. Previous work! has established that HR2 is an 
effective diagnostic for distinguishing magnetic cataclysmic variables (mCVs) 
with thermal spectra from millisecond pulsars, neutron-star binaries and black- 
hole binaries with non-thermal spectra. mCVs are known to emit as an optically 
thin plasma of approximate temperature 8-40 keV with an accompanying iron-line 


complex at E~ 6-7 keV (ref. 13). This is in contrast to the populations of millisecond 
pulsars, neutron-star binaries and black-hole binaries, which exhibit their true 
non-thermal power-law spectra with photon indices of [’~ 1-2.5 (refs 12, 23). 
On the basis of simulations of Chandra spectra, we adjusted the energy bands of 
HR2 so that the largest distinction between the thermal and non-thermal models 
could be obtained. Our resultant definition of HR2 differs slightly from that used 
previously'®. HR2 =0.3 was adopted as the dividing line between the thermally 
emitting mCVs (HR2 > 0.3) and the non-thermal source populations (HR2 < 0.3). 
Because higher HR2 corresponds to spectrally harder sources, mCVs appear harder 
in the Chandra bandpass even though they have thermal spectra. More recent 
work®!335 has shown that the mCVs in the Galactic Centre are in fact completely 
dominated by a sub-class of mCV called intermediate polars (IPs); henceforth, we 
use that designation for the thermal source population characterized by HR2 > 0.3. 
We performed XSPEC simulations to establish the conversion from photon index 
to HR2, for a range of neutral hydrogen column densities typical of the Galactic 
Centre ((6-17) x 10”? cm~?; refs 36, 37). XSPEC simulations were also used to 
determine at what HR2 typical IPs with kT’ ~ 8-40 keV would appear. For this 
simulation, we chose four canonical IPs (V709 Cas, NY Lup, V1223 Sgr and TV 
Col), representative of a range of plasma temperatures and neutral hydrogen 
column densities associated with an accretion curtain at the distance of the Galactic 
Centre (8 kpc)!°. In all cases the IPs had HR2 > 0.3. For reference, HR2 = 0.3 
corresponds to ‘= 1.5 for non-thermal spectra with a hydrogen column density of 
Nu=14 x 10” cm, the median of the range of values in the Galactic Centre. This 
Tis typical for the majority of NS- and BH-LMXBs, and rMSPs'*?38, 

The HR2 distribution for 1 pc <r< 3.8 pc has two extreme outliers (CXO 
J174540.20—285900.8 and CXO J174537.98—290134.5), at HR2 values of 
—0.7479-13 and —0.561) 9, respectively. These sources are not considered to be 
IPs or non-thermal sources (see below), but they cannot be excluded from the 
Kolmogorov-Smirnov (KS) test because they lie in a range of HR2 where, with 
their errors, they could contribute to the r< 1 pc HR2 distribution. However, 
because these are the only two outliers at HR2 < 0.3, out of the 66 sources for 
1 pc<r<3.8 pe, there can be at most 1 out of the 12 HR2 < 0.3 sources for r< 1 pe 
that arises from the 1 pc <r<3.8 pc population. This is an extremely conservative 
estimate. Using the errors on the two HR2 outliers and on all of the r< 1 pe sources, 
there is only an approximately 2% chance that even one of the HR2 < 0.3 sources 
for r< 1 pc is from the same source population(s) as the 1 pc <r < 3.8 pc sources. 

Given the results of the KS test, we performed a parametric test to confirm that 
the source population for r< 1 pc is different from that (or those) for 
lpc<r<3.8 pe. Extended Data Fig. 3 shows the radial distribution of HR2 for the 
sources with C> 100 as a function of projected distance from Sgr A*. The errors 
on the HR2 determination are normally distributed, as determined by simulations, 
so a standard y’ test can be applied to test for the constancy (expected for an IP 
population) of HR2 as a function of distance from Sgr A*. Fitting a function 
HR2(r) = constant to the data for r> 1 pc yields the best-fitting HR2 of 0.59. The 
fact that the reduced \? value 0? = 1.39 for 63 d.o.f.) exceeds 1 suggests that the 
spectral hardness data are intrinsically scattered, probably owing to the spread of 
the IP temperature distribution, which is typically in the range kT ~ 8-40 keV. We 
estimated the systematic errors associated with the intrinsic HR2 distribution of 
r> 1 pc sources by taking the Gaussian quadrature difference between the total 
variances and the statistical errors. We found that the systematic errors (about 8%) 
are subdominant compared to the statistical errors (about 20%). To test the 
hypothesis of a bifurcation in the HR2 distribution at r< 1 pc, we fitted a constant 
HR2 function to the sources for 0.2 pc< r< 3.8 pc, taking into account the 
systematic and statistical errors. The fit gives a large y” value 02 = 3.0 for 89 d.of., 
P=1.1~x 10~!), which establishes the non-constancy of HR2 and thus the 
presence of a different source population for r< 1 pc. The outliers were excluded 
in this analysis of HR2 spatial constancy because their HR2 values are more than 
50 from even the lowest HR2 (with error) from the 64 other 1 pe<r< 3.8 pce 
sources. However, consistent with the KS test, one HR2 < 0.3 source at r< 1 could 
be associated with the r>1 pc source population(s). Because the x? test is 
parametric and depends on the assumption of spatial constancy of the dominant 
IP population, it is weaker than the non-parametric KS test, which makes no 
assumptions about the spatial dependence of HR2 and includes the outliers; but it 
provides additional confirmation of the newly identified population for r<1 pc. 

The two outlier sources have sufficient counts to enable us to fit their Chandra 
spectra. Fitting with an absorbed power-law model yields very soft photon indi- 
ces (I> 4), indicating thermal X-ray spectra. The source CXO J174540.20— 
285900.8 fits well to an absorbed blackbody model with kT ~0.5 + 0.1 keV. The 
best-fitting blackbody temperature is consistent with persistent thermal emission 
of a magnetar with typical temperatures of kT ~0.3-0.5 keV (ref. 39). The other 
source (CXO J174537.98—290134.5) exhibits several emission lines in the energy 
range E~ 2-3keV and its spectrum fits to an absorbed, optically thin thermal 
plasma (tbabs*apec) model with best-fitting temperature kT = 0.8 + 0.1 keV. 
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This source is probably a dwarf nova, given that its temperature is lower than those 
of IPs and polars. The two outliers are not shown in Fig. 3 and Extended Data Fig. 3. 
Colour-colour analysis. One systematic uncertainty that must be addressed in 
both the hardness ratio analysis and the spectral analysis is the possible degeneracy 
between hydrogen column density and spectral hardness/softness. There are spatial 
variations in the hydrogen column density in the Galactic Centre. These variations 
can cause an intrinsically hard source at low Ny in the Chandra bandpass to be 
misidentified as a more highly absorbed soft source, because both may yield the 
same HR2. We investigated whether this degeneracy could lead to systematically 
lower HR2 values in the central parsec. The degeneracy would also affect the 
spectral analysis, as discussed below. To evaluate the possibility of degeneracy 
affecting our analysis, we resorted to colour-colour plots, a method that is com- 
monly used to estimate the severity of absorption effects. For this investigation 
the second colour HR3 was used, because it utilizes the 1-3-keV energy band and 
so is more sensitive to hydrogen column density than is HR2, which focuses on 
higher-energy X-rays. To assess the sensitivity of HR3 to absorption effects, the 
XSPEC routine fakeit was used to generate simulated Chandra spectra of both 
IP thermal spectra at kT = 8-40 keV and non-thermal power-law spectra with 
I’ 1-2.5. XSPEC simulations were used to calculate HR2 and HR3 for the input 
spectra. They demonstrated that an intrinsic HR2 > 0.3 source could only appear 
as an apparent HR2 < 0.3 source, owing to lower column density, if HR3 < 0.6. 
On the contrary, HR2 < 0.3 sources always have HR3 > 0.9. Extended Data Fig. 5 
shows a HR2-HR3 scatter plot of all of the sources, differentiating those for r< 1 pc 
(cyan) and those for r> 1 pc (red). With the exception of two sources (which are 
intrinsically hard sources with low Ny ~ 107? cm~?, on the basis of spectral fitting), 
all of the HR2 < 0.3 sources have high HR3, confirming that they are intrinsi- 
cally soft spectral sources rather than hard sources (intrinsic HR2 > 0.3) with low 
column density masquerading as low-HR2 soft sources. In fact, the systematics of 
the absorption are such that very high column densities would cause intrinsically 
low-HR2 sources to appear as higher-HR2 sources with high HR3. Thus the HR2 
analysis of the number of soft sources for r< 1 pc is conservative. 

Quantile hardness ratio analysis. More recently, extensive use has been made of 
quantile analysis'**> instead of the colour-colour analysis of Extended Data Fig. 5. 
Quantile plots have been shown to be more effective in breaking photon index 
(hardness ratio)-hydrogen column density degeneracies. Extended Data Fig. 6 
shows the same data as in Extended Data Fig. 5, but in a quantile plot. Here Q, is 
the energy within which n% of the photons in the 2-8-keV band are contained and 
m is the median energy below which 50% of the photons in the band are contained. 
The quantile diagram also shows contours of column density and photon index, 
along which the quantile parameters of the x and y axis have been calculated using 
an assumed non-thermal spectral shape and Chandra’s response function. The 
diagram clearly shows the two distinct populations of sources. Nearly all of the 
sources for r> 1 pc have I< 1 whereas about half of the sources for r< 1 pc have 
softer power-law photon indices of J’~ 1-2. The large scatter of all of the sources 
with column density shows no systematic bias in source hardness due to column 
density effects. 

Spectral analysis of individual, bright Chandra sources. A spectral analysis, 
albeit available for only a smaller number of sources (13 for r< 1 pc and 28 for 
1 pc<r<3.8 pc) provides a second method for confirming the presence of 
distinct source populations inside and outside 1 pc. We fitted Chandra spectra of 
the sources with sufficient net counts (C > 200) using thermal (thabs*pcfabs*apec) 
and non-thermal models (thabs*powerlaw) in XSPEC. Spectral bins were adap- 
tively re-binned so that the counts in each bin have at least 3c statistical significance 
over background. 

Outside r= 1 pc, 16 of the 28 HR2 > 0.3 sources with C> 200 are well fitted by 
a thermal model with kT + 5-10keV. These brighter sources, as demonstrated by 
the example shown in Extended Data Fig. la, clearly exhibit broad iron lines at 
6-7 keV, characteristic of mCVs. The fainter 10 HR2 > 0.3 sources are well fitted 
by hard power-law models with I’< 1. Previous studies!° with Chandra also found 
that faint mCVs are readily fitted by power laws with '~ 0-1. 

In contrast, inside r= 1 pc, the five HR2 < 0.3 sources with C> 200 are well 
fitted by a power-law model with [°~ 1.5-2 and no apparent iron lines. Extended 
Data Fig. 1b shows the spectrum of a bright HR2 < 0.3 source with C= 267 fitted to 
a power-law model with = 1.9 + 0.3. Of the seven HR2 > 0.3 sources, similarly to 
the case of r> 1 pc sources, the five bright sources are well fitted by a thermal model 
with broad iron lines and the two fainter sources by a power-law model with I’< 1. 

Using the logN-logS values measured for both of the soft sources, and a 
much larger sample of hard sources (IPs) in the Galactic Centre (N(>S)=kS~°, 
a= 1.4)*°, we expect around four soft and eight hard sources inside r= 1 pc to 
have sufficient counts for spectral analysis. These numbers are roughly consistent 
with the five soft and seven hard sources inside r= 1 pc that we found that are 
amenable to spectral fitting. 
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MARX simulation. To validate our source/background extraction method and the 
subsequent spectroscopy and colour photometry, we employed MARX simulations 
followed by AE analysis. MARX simulation allows us to input sources, represent- 
ing mCVs and BH-LMXBs, on top of the diffuse, gaseous structures in order to 
assess whether we can recover the proper input spectrum and the input HR2 hard- 
ness ratios within statistical uncertainties. We generated fake point sources with 
input power-law spectra with [’= 1.5-2. Input X-ray fluxes were adjusted to give 
C= 100-200 in the 2-8-keV band, similar to those of the 12 HR2 < 0.3 sources 
for r< 1 pc (Extended Data Table 1). In each MARX simulation, 5-10 fake sources 
were added to all 45 Chandra observations at various source-free locations within 
0.2 pc<r<1 pc including the ridge detected by ref. 40 (representing one of the 
highest background regions in r =0.2-3.8 pc). This is the most realistic simulation 
because we take into account the actual background, vignetting, point spread func- 
tion and dithering effects for all 45 Chandra observations used for data analysis. A 
total of 124 hard and soft sources were run at 32 positions at r < 1 pc. The MARX 
sources were placed as close as possible to the observed sources whenever feasible. 
Often a MARX simulation was run repeatedly at a given position to build up sta- 
tistics on photometry and spectral recovery in the face of counting statistics. We 
analysed the simulated Chandra data using AE, calculated hardness ratios of 
the fake sources and performed spectral fitting with XSPEC. We recovered the 
(soft) input HR2 values within statistical errors for 65 out of 67 simulated sources 
with C = 100. Thus, the probability of misidentifying an intrinsically soft source 
(HR2 < 0.3) as a hard source (HR2 > 0.3) is about 3%. More importantly, the prob- 
ability for hard sources (with IP thermal spectra) with C = 100 being misidentified 
as soft sources was about 2% (56 out of 57 simulated hard sources). Thus, at r < 1 pe, 
the mean number of soft sources turning into hard sources is about 0.4 and the 
mean number of hard sources turning into soft sources is about 0.3. For fainter 
sources with C < 50, the false probability increases to more than about 20%. Thus, 
we conclude that the threshold for reliable, spectroscopic identification between 
non-thermal X-ray sources and IPs is C= 100 throughout the 0.2 pe< r <3.8 pe 
region we analysed. 

Scattering effects on spectral analysis. Owing to dust scattering of X-ray 
photons, a point source can appear with a diffuse halo that extends beyond a 
typical source-extraction region*". Dust scattering haloes have been detected from 
Chandra observations of bright X-ray transients (such as AX J1745.6—2901” and 
Swift J174540.7—290015*%). Because the dust scattering is more substantial at low 
energy, the X-ray spectrum of a source may appear harder if scattered photons 
are not fully collected within an extraction region. For example, a non-thermal 
power-law X-ray spectrum could be hardened by up to AI’< 0.5 when Ny > 1077 
cm * (ref, 44). Unlike bright X-ray transients, no dust scattering halo is observed 
from the (fainter) Chandra X-ray sources because it is buried under diffuse X-ray 
background in the Galactic Centre. 

To assess the effects of dust scattering, we applied a spectral model*”. The model 
parameters, such as grain sizes and types, column densities and distances of dust 
layers, have been uniquely determined by fitting the Chandra radial profiles of the 
outbursting NS-LMXB AX J1745.6—2901, which lies 1.5 arcmin away from Sgr A*. 
The model requires a foreground dust layer in the spiral arms a few kiloparsecs 
away from the Galactic Centre and another layer closer to AX J1745.6—2901. The 
foreground dust layer with Ny ~ 1.7 x 10” cm * is likely to lie in the line of sight 
for other X-ray sources within a few arcmin of Sgr A* (ref. 42). 

We simulated Chandra ACIS-I spectra that correspond to our source-extraction 
size (typically 1”) using various input spectral models, including a power-law 
with /’~ 1-2.5 (representing non-thermal sources) and a partially covered APEC 
model with kT'~ 20 keV (representing typical IP-like spectra). For each spectral 
model, we simulated Chandra ACIS-I spectra with and without the multiplica- 
tive fgcdust scattering model in XSPEC. Then, we calculated HR2 hardness ratios. 
We also fit power-law (tbabs*powerlaw) and thermal (tbabs*pcfabs*apec) models 
to the simulated spectra of non-thermal and thermal sources, respectively. The 
differences in the best-fitting HR2 values, power-law indices and temperatures 
between the simulated Chandra spectra with and without the dust scattering model 
provide an estimate for how much spectral hardening could be caused by typical 
dust scattering in the Galactic Centre. Regardless of the input model, thermal or 
non-thermal, dust scattering hardens HR2 by AHR2=0.1. Thus, our hardness 
selection using HR2 is relatively unaffected by dust scattering for the relevant 
spectral types. The net effect of dust scattering is to harden non-thermal and 
thermal Chandra spectra by A'=0.3 (consistent with previous results“) and 
AkT=0.5 keV, respectively. 

Stacked spectra analysis. That the HR2 cut produces two distinct source 
populations is further supported by an analysis of stacked Chandra ACIS spectra 
in the central parsec. Using the combine_spectra command in the Chandra analysis 
software CIAO, we combined source and background spectra files for each group 
of the HR2 < 0.3 and HR2 > 0.3 sources. We excluded several sources from the 
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stacked spectra because their very high background counts are dominant and so 
lower the signal-to-noise ratios. Figs 2a and b in the main text show the stacked 
spectrum of the 8 HR2 > 0.3 sources with strong Fe lines and 12 HR2 < 0.3 sources, 
respectively. In both cases, all sources have net counts C 2 100 and they are located 
in the central parsec. 

First, we fitted an absorbed power-law model (thabs*powerlaw) to both stacked 
Chandra spectra. To accurately determine the spectral parameters and column 
density Ny, we applied the multiplicative fecdust dust scattering model*!. The 
stacked Chandra spectra of the HR2 < 0.3 sources in the central parsec are well 
fitted 63 = 0.81, d.o.f. =51) by an absorbed power-law model yielding P= 1.9793 
and Ny = 0.8")'5 x 1073 cm~? (Fig. 2b). 

We found that the 14 HR2 > 0.3 sources at r <1 pc divide into two subgroups 
with different spectral characteristics: 8 of them exhibit strong Fe lines in their 
Chandra spectra, while the other 6 sources show either no or weak Fe lines. An 
absorbed power-law model fit to both stacked spectra yields a hard photon index 
of I~ 0.6-0.7. The hard photon index (I’1) isa common signature of IPs*°. The 
power-law model yields a poor fit to the stacked spectra of the 
8 HR2 > 0.3 sources with strong Fe lines x? = 1.90 for 39 d.o.f.). 

Since the IP spectra are physically best modelled as emission from an optically 
thin thermal plasma, we employed the APEC model in XSPEC. We added a 
partial-covering absorption model (pcfabs) to account for photo-absorption in the 
accretion curtain or X-ray reflection from the white dwarf surface and a Gaussian 
line component for the neutral Fe-K fluorescence line’*. Using this fiducial IP 
spectral model, that is, fycdust*tbabs*fcfabs*(apec+-gauss), the stacked Chandra 
spectra of the 8 HR2 > 0.3 sources with strong Fe lines fit well, yielding the best-fit 
temperature of kT = 6.311'$ keV (Fig. 2a), while the stacked Chandra spectra of 
the 6 HR2 > 0.3 sources with no Fe lines fit to a significantly higher temperature 
(kT =19_ 9 keV). In the latter case, the temperature is not well constrained owing 
to the lack of Fe lines in the Chandra energy band below 8 keV. It is likely that the 
HR2 > 0.3 sources are composed of 8 IPs with kT ~ 5-8 keV (with Fe lines) and 6 
IPs with higher temperatures where the Fe complex is over-ionized. 

The overall stacked spectrum of all 14 HR2 > 0.3 sources fits to a partially 
covered thermal model well (x? = 0.80, d.o.f. =58), yielding the best-fit temperature 
of kT=7.3'}$ keV (Extended Data Fig. 7). The best-fit Fe abundance 


Age = 0.6'9'3 (relative to solar) is consistent with previous spectral fits for the 
unresolved, diffuse IP component of the CHXE®”. Our results from the best-fit 
models are summarized in Extended Data Table 2. The best-fit plasma temperature 
of kT = 7.317'3 keV represents an ensemble of IPs with different temperatures and 
it is consistent with the stacked spectra of the HR2 > 0.3 sources for 1 pce<r<3.8 pe 
(kT = 5.57 0$ keV) as well as the results from ref. 6 (kT =7.5}'> keV) and ref. 15 
(kT 7-9 keV). 

Hard and soft spectrum background systematics. Here we address whether the 
spectral difference between the soft and hard sources could be due to non-uniform 
background in the Galactic Centre region or to thermal hotspots in the diffuse 
emission. Extended Data Fig. 8 shows the stacked background spectra for both 
the HR2 < 0.3 sources and HR2 > 0.3 sources (with C> 100). They do not fit a 
power-law model with large e = 3.4-3.6 owing to the emission lines at 2-4 keV 
(from low-Z elements) and 6-7 keV (from Fe). Both background spectra fit 
well with an absorbed two-temperature APEC model yielding xe w= 1.2-1.4. 
The best-fit temperatures are consistent between the two background 
spectra: kT, = 0.9379 ‘94 keV and kT, =5.9'3'5 keV (for the soft source back- 
ground) and kT, = 1.1870’); keV and kT) = 6.6}; keV (for the hard source 
background). Thus the spectrally distinct nature of the HR2 < 0.3 and HR2 > 0.3 
sources cannot be attributed to spectral variations in the background, and their 
backgrounds are not consistent with non-thermal spectra. 

Spurious power-law spectra could result from subtracting thermal background 
spectra from ‘hotspots. These point-like hotspots are possibly due to small-scale 
density variations (about 0.1 pc) in the (thermal) diffuse emission in the Galactic 
Centre. They are speculated"! to be the origin of a thermal point source excess for 
low-flux sources in the Galactic Centre. These should become significant only at 
fluxes that corresponds to C< 40, about 3-7 times below the net counts of the 
HR2 < 0.3 sources. Nevertheless, we investigated the possibility that subtracting a 
similar background spectrum from a hotspot thermal spectrum could produce a 
power-law spectrum. Using the measured background for the HR2 < 0.3 sources, 
XSPEC simulations were run for simulated thermal sources with plasma temper- 
atures (kT) =3.5-8 keV) and abundance (Z; = 0.3-1.5) encompassing those 
observed in the central few parsecs. The source normalization was adjusted to 
correspond to our observed source net counts. Using the same rebinning and fitting 
procedures as for our overall analysis, we found that the stacked spectrum for the 
12 simulated thermal hotspots was indeed a power law, but with best-fit photon 
indices of "= 2.5-4, steeper than our observed HR2 < 0.3 sources (= 1.9'9'3). 
We could only get closer to our observations by adjusting the net counts to be well 


below our 100-count threshold. And in almost all cases including the best-fit 
photon indices [°~2.5, the spectra showed strong emission lines at 2-4keV and 
6-7 keV (see typical example in Extended Data Fig. 9). Consequently, all the fits 
were poor with an absorbed power-law model (x? ~ 1.4-1.8 with approximately 
50-60 d.o.f.) and, owing to the emission lines, looked nothing like our featureless 
spectra of the HR2 < 0.3 sources. 
Variability study. For the 12 HR2 < 0.3 point sources within r< 1 pc, we eval- 
uated their X-ray variability by applying the Bayesian-Block (BB) algorithm*® to 
the unbinned photon arrival times after removing time gaps between the Chandra 
observations spanning around 12 years. No exposure map or vignetting effect 
correction is required because we verified that none of the sources at r <1 pc are 
near the chip gaps and all 45 observations were pointed at Sgr A*. The BB algorithm 
is more sensitive to short-term variability than the KS test, which evaluates the sta- 
tistical significance of variability over the full time interval. Our analysis followed 
ref. 46. Among the 12 HR2 <0.3 X-ray sources, 6 sources (50%) were variable at 
the 90%-99% confidence level (Extended Data Table 1). Since the BB algorithm 
works on source and background events, we did an independent timing analysis on 
nearby regions to ensure that variability was not due to time varying background. 
Although BH-LMXBs and rMSPs both exhibit non-thermal X-ray spectra, X-ray 
variability can be used to distinguish between them. Most (around 70%) of the 
known BH-LMXBs observed in the quiescent state on multiple occasions have 
shown evidence of X-ray flux variability, many by a factor of 2—5 over a times- 
cale of days to years, owing to accretion”’. On the other hand, rMSPs display no 
such long-term variability. Whereas a majority of rMSPs emit thermal X-rays 
with blackbody temperatures of kT < 0.3 keV originating from the polar caps, 
non-thermal X-ray emission has been detected from about 30% of rMSPs744748, 
The population of rMSPs with only soft thermal emission, if they are located in 
the Galactic Centre, are not observable in the X-ray band because the soft thermal 
X-ray emission of this population will be completely absorbed owing to high col- 
umn density Nj 107? cm’. The population with non-thermal emission, which 
represents just over one-third of all approximately 50 known rMSPs detected in the 
X-ray band, exhibit power-law spectra with = 1-1.5 and Lx ~ 10°°-10* ergs}. 
The non-thermal X-ray emission from rMSPs originates either from the magne- 
tosphere or from an intra-binary shock (for rMSPs in binary systems)”***. Orbital 
variability has sometimes been detected owing to eclipsing from the companion“. 
Isolated MSPs do not show X-ray variability on timescales greater than their mil- 
lisecond spin periods because non-thermal X-rays are emitted from the magne- 
tosphere”‘. On the other hand, non-thermal X-ray emission from binary rMSPs 
often exhibits modulation over the orbital periods of these sources (hours to a few 
days); but these sources do not show the long-term variability that is observed 
for many of the Chandra point sources in the Galactic Centre. Thus, the detec- 
tion of long-term variability over months or years in the Chandra sources can be 
used as a strong indicator against rMSPs. On the other hand, timing studies of 
nearby qBH-LMXBs show that about 30% of them do not show variability and 
one would expect an even larger fraction to appear ‘steady’ at the great distance of 
the Galactic Centre, where variability is harder to detect and poorly constrained in 
fields of high background. Based on this variability argument, one-half or less of the 
HR2 < 0.3 sources may be rMSPs. 
Population analysis. Flaring emission from the coronae of main-sequence stars 
has been proposed previously to account for a substantial fraction of the soft-X-ray 
emission in the Galactic Centre, but only in the central approximately 0.1 pc (ref. 22). 
The mean luminosity of these stars in the quiescent state is more than three orders 
of magnitude below our sensitivity limit and, even when flaring, a factor of about 
30 below our sensitivity limit (10°°-10*° erg s!)?. The active stellar binaries (RS 
CVn, BY Dra and Algols) are similarly excluded because even objects in the bright 
RS CVn class have a luminosity of about 10°°-107! ergs” | more than about three 
times below the detection threshold”*. Moreover, because the stellar mass inside 
1 pc is 40% less than that in the 1 pe < r<3.8 pc annulus and X-ray emitting 
stars are not present in the annulus, the contribution of stars inside 1 pc must be 
negligible. 
Spatial morphology. We extracted the power-law index for the three-dimensional 
density n(r) of soft (HR2 < 0.3) sources by using the assumed form n(r) =kr~7 and 
projecting this along the line of sight to obtain a best fit to the surface density 
(R)=kR-®. The appropriate integral is 5(R) = 2 fren(r? — R?)-1/2dr, where 
rand R are the actual and projected distances from Sgr A*, respectively. We used 
a Python fitting routine in the scipy.optimize.curve_fit package. For the RBH-LMXB 
candidates this gives y= 2.4) 3. If the steady rMSP candidate sources are 
removed, a slightly shallower cusp results, with y= 2.0°)7. We also included 
fainter sources with C > 50 to estimate the radial extent of the soft source distribu- 
tion without the rMSP candidates. This yields a power-law index of y= 1.592. 
The stellar cusp was recently detected” in the population of old stars in the Galactic 
Centre and at a comparable projected distance range to that explored here, but had 
an associated power-law fit with a comparable cusp of y= 1.5. 
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Total number of non-thermal sources in the central parsec. We used the logN- 
logS plot in Extended Data Fig. 2 to estimate the total number of non-thermal 
sources. The extrapolation to fainter sources below the Chandra flux threshold was 
based on a previously reported intrinsic luminosity distribution®®. The slope of 
this distribution (a = 1.4*9'}) is in agreement, within errors, with that measured 
for the qBH-LMXB candidates in the central parsec (a = 1.8*)'5). A cross-check 
on the logN-logS results can be obtained by using the measured slope for the soft, 
r<1 pc sources to calculate the mean luminosity per source, assuming that the 
logN-logS plot extends down to the faintest qBH-LMXB observed™ locally. We 
have L(total) /N = J[NQ)LAL/ [NLL where N(L) = dN(>L)/dL and the lim- 
its of integration are set by the previously used value*® of Lyin and the intrinsic 
luminosity of the brightest soft source that we observed in the central parsec. This 
gives (L) =4 x 10°! ergs” for the soft sources of the central parsec, in good agree- 
ment with the previously observed®? mean luminosity of 5 x 10°! erg s-'. However, 
the estimate of the total non-thermal source population should be viewed with 
caution. It is based on the value of Lyin of a small sample (18) of relatively nearby 
black-hole binaries and it is unclear how representative these are of the qBH- 
LMXBs of the Galactic Centre. As an example, increasing Lmin by a factor of two 
would reduce the extrapolated number of sources by a factor of about three. 
Although we cannot rule out that the steady sources that comprise one-half 
of our non-thermal sources in the central parsec are rMSPs, we might expect 
them to have a more extended spatial distribution, because neutron stars receive 
substantial kick velocities on formation. Even when situated in binaries, and taking 
into account the higher escape velocity in the Galactic Centre, a more extended 
distribution, beyond 1 pc, for the rMSP population is likely. However, because 
rMSPs are an old population, those that have escaped the centre may be located 
at much larger distances than the 3.8 pc to which our analysis extends. Those in 
the central parsec are the fraction that did not have sufficient kinetic energy to 
escape. In fact, all of our observations are within the approximately 3-pc influ- 
ence radius in the Galactic Centre. Detailed modelling is required to resolve these 
questions. 
Transient emission from the qBH-LMXB. Four X-ray binary transients were 
previously reported in the central parsec, representing an over-density per unit 
stellar mass of about 20 times compared to the surrounding tens of parsecs”’. 
Subsequent work established that three of those transients are NS-LMXBs, leaving 
one potential BH-LMXB candidate in the central parsec, which has had only its 
original large outburst in more than a decade of monitoring'”'®. More recently, 
several new transients that are strong BH-LMXB candidates have been identified 
in the Galactic Centre (K.M., C.J.H., D. Haggard, B.J.H., C. Jin, S. Mandel, M. 
Nynka, G. Ponti and J. Tomsick, manuscript in preparation). If qBH-LMXBs have 
large outburst recurrence times of about 100 years, then in more than 10 years of 
monitoring 40 or more outbursts would have been expected from the extrapolated 
qBH-LMXB population, which is clearly inconsistent with observations. If the 
recurrence time is about 1,000 years, then there is rough consistency between the 
number of BH-LMXB transients observed in the central parsec over the more 
than 10 years of monitoring, and the non-detection of large outbursts among 
the dozen X-ray binaries reported here. Such a long recurrence time may not be 
implausible. Observations of the local qBH-LMXB population reveal a correla- 
tion between luminosity and orbital period, such that the sources below our flux 
threshold all have orbital periods P of less than about 10h. But according to the 
accretion disk instability model, very faint systems with P less than about 10h are 
precisely those whose accretion rate is below the critical rate required to induce 
large outbursts. If they are accreting at around 10~° times the critical accretion rate, 
then recurrence times of around 1,000 years are realized. The qBH-LMXBs above 
our detection threshold are both brighter and, according to the Lx-P correlation, 
should have longer periods. They would therefore burst at a higher rate, but still 
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be consistent with the non-observation of large outbursts from their locations. 
A similar argument, based on a low accretion rate leading to a lack of recurrent 
outbursts, has been used to explain the inefficiency of X-ray searches in uncov- 
ering black hole-Be binaries*!. No such problem arises for rMSPs, which do not 
have outbursts. 

Code availability. We used standard software for all our data analysis: the Chandra 
Interactive Analysis of Observations (CIAO; http://cxc.harvard.edu/ciao/), the 
MARX simulator (http://space.mit.edu/CXC/MARX/), the ACIS Extract package 
(http://personal.psu.edu/psb6/TARA/AE. html) and NASA’ HEASOFT software 
(https://heasarc.nasa.gov/lheasoft/). 

Data availability. The Chandra data that support the findings of this study are 
available from NASA's HEASARC data archive (https://heasarc.gsfc.nasa.gov/docs/ 
archive.html). The data shown in the figures and tables are available from the 
corresponding author on reasonable request. 
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Extended Data Figure 1 | Example Chandra spectra of thermal and 
non-thermal X-ray sources. a, CXO J174541.02—290017.6 (HR2 > 0.3). 
b, CXO J174539.48—290045.8 (HR2 < 0.3). The hard (thermal) source 
spectrum, which exhibits a broad emission line at E= 6-7 keV, is fitted 


by a partially covered APEC model, whereas the soft (non-thermal) 
source spectrum is fitted by an absorbed power-law model. The error bars 
represent 1a statistical uncertainties. The bottom panels show residuals 
(data minus model) in terms of 1o significance. 
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Extended Data Figure 2 | A cumulative X-ray flux distribution of the shown as a solid blue line. An extrapolation of this line to the minimum 
qBH-LMXB candidates within 1 pc. A logN-logS histogram of the observed flux (1.9 x 10~'° erg cm~? s“!, after correcting for higher X-ray 
12 HR2 < 0.3 sources (assuming all are qBH-LMXBs) is plotted, where S extinction in the Galactic Centre) of the local qBH-LMXB population” is 
denotes an absorbed 2-8-keV flux. The best-fitting power-law model is used to estimate the total number of gBH-LMXBs in the central parsec. 
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Extended Data Figure 3 | Hardness ratio for X-ray point sources with into account statistical and systematic errors in the HR2 data; the error 
C= 100 as a function of the projected radial distance from Sgr A*. bars in the figure indicate 1o statistical uncertainties. The hypothesis 
The horizontal line denotes the best fit to a radially constant hardness ratio _of spatial constancy of HR2 is thus rejected with P= 1.1 x 107. The 
(HR2 = 0.53), which yields a poor fit with x, = 3.0 for 89 d.o.f. We took bifurcation in HR2 at r< 1 pc is clearly visible. 
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Extended Data Figure 4 | Chandra images of the soft source CKO 
J174540.79—290024.5 and its extracted background events. 

a, Neighborhood image around the source (labelled 26, with multiple 
green circles near the centre indicating source extraction regions for 
different observations) along with other Chandra catalogue sources 
indicated by numbers 23, 24 and 27. b, Background image around the 


2.5 3 3.5 4 4.5 é 


source (labelled 26 and located near the centre with a green circle) 
from ObsID 11843. The red dashed circle indicates the inner circle of 


the background annulus (r= 2.5”). Note that background was properly 


extracted from an annulus of r~2.5’—-4” by excluding the nearby Chandra 
sources indicated by green circles labelled 23, 24 and 27. 
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Extended Data Figure 5 | Colour-colour diagram of X-ray sources 

with C2 100 within radial projected distances from Sgr A*. HR2 

and HR3 hardness data are plotted for sources at r< 1 pc (cyan) and 

1 pce<r<3.8 pc (red). The error bars represent 1o statistical uncertainties. 
There is no clear evidence of HR3-HR2 correlation, a surrogate for 
column-density- or scattering-related effects. The vertical dotted grey line 


0.6 0.8 1.0 1.2 


HR2 


indicates our dividing line at HR2 =0.3 between the soft and hard sources. 
For reference, the calculated HR2 and HR3 for four local IPs, whose 
spectra were extrapolated to the distance of Sgr A*, are shown as black 
diamonds. They cluster at high HR2, as expected. Only three of the four IP 
diamonds are visible because two of them overlap. 
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Extended Data Figure 6 | Median photon energy in the spectrum versus 
hardness ratio of X-ray sources with C > 100. Sources for r< 1 pc and 
r > 1pc are indicated in cyan and red, respectively. The median m = Qso 
and the ratio of two quartiles Qys/Q7s are calculated for each source!*. The 


energy scale across the top shows the median energy values (E50%). The blue 
and red lines correspond to hydrogen column density Ny and power-law 
photon index I’ grids, respectively. The error bars represent 1o statistical 
uncertainties. 
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Energy (keV) 
Extended Data Figure 7 | Stacked Chandra ACIS-I spectrum of the represent lo statistical uncertainties. The bottom panels show residuals 
14 thermal X-ray sources with net counts C > 100 within r < 1 pc. (data minus model) in terms of 1o significance. The best-fit parameters 
The spectrum is well fitted with a partially covered thermal APEC model are shown in Extended Data Table 2. 


with a Gaussian line at 6.4 keV Ca = 0.80 for 58 d.o.f.). The error bars 
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Extended Data Figure 8 | Stacked background Chandra ACIS-I spectra 
of the sources with net counts C > 100 within r < 1 pe. a, 12 HR2<0.3 
sources. b, 14 HR2 > 0.3 sources. Several prominent emission lines are 
present from low-Z elements (below 4 keV) and Fe (at 6-7 keV). Both 
background spectra are well fitted with an absorbed two-temperature 
thermal APEC model: x, = 1.21(291 d.o.f.; soft source background) and 
x= = 1.26 (320 d.o.f; hard source background). The best-fit temperatures 
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are consistent between the two background spectra: kT, = 0.937) '9y keV 
and kT) = 5.9*5'3 keV (for the soft source background) and 

kT, = 1.187009 keV and kT, = 6.67]; keV (for the hard source 
background). The error bars represent Io statistical uncertainties. 


The bottom panels show residuals (data minus model) in terms of 
lo significance. 
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Extended Data Figure 9 | Simulated Chandra ACIS-I spectrum using 
a two-temperature thermal APEC model. The input parameters are Be 

kT, = 1.0 keV and Z, = 2.5 (for the lower-temperature component) and significantly softer than that of the stacked soft source spectrum in 
kT, =5.0 keV and Z, = 0.6 (for the higher-temperature component). Fig. 2b (= 1.9703). 

The spectrum was poorly fit with an absorbed power-law model yielding , 


x? = 1.65 (56 d.o.f.; hard source background) and showing large residuals 
at 2-3 keV and 6-7 keV. The best-fit photon index (= 3. 110-4) is 
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Extended Data Table 1 | The 12 non-thermal (soft) X-ray sources with C > 100 in the central parsec 
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Sandan Projected Distance Net counts Long term Wavdetect 
from Sgr A* [pc] (2-8 keV) = X-ray variability Significance (c) 
174539.87-290034.2 0.23 261 TRUE 15.6 
174540.38-290033.5 0.32 100 FALSE 4.6 
174540.40-290024.1 0.32 301 FALSE 15.2 
174540.45-290036.3 0.43 171 TRUE 8.1 
174540.79-290024.5 @.52 193 TRUE 24.2 
174539.40-290040.9 0.60 309 TRUE vale 
174540.95-290031.2 0.60 274 FALSE 14.7 
174541.03-290026.8 0.65 122 FALSE 15.4 
174540.63-290013.4 0.73 130 FALSE 6.7 
174539.48-290045.8 0.75 268 TRUE 32.6 
174540.37-290049.9 0.88 136 FALSE 10.8 
174539.28-290049.1 0.92 136 TRUE 8.4 


The last column indicates the source detection significance calculated by wavdetect, a wavelet algorithm used widely for Chandra analysis. The sources have HR2 <0.3. 
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Extended Data Table 2 | Best-fitting parameters from fitting the stacked Chandra spectra 


Parameters Hard Sources (HR2 > 0.3) Soft Sources (HR2 < 0.3) 
Model fgcdust*tbabs*pcfabs*(apec+gauss)  fgcdust*tbabs*powerlaw 
Ni [1023cm’2] (Galactic) Li 0.8.93" 

Nu [107%cm°?] (Partial covering) 204% = 

Covering fraction 0.958" a 

kT [keV] 7.343 = 

Are 0.655" = 

r a! 1.93 

2-8 keV flux [10°7erg cm? s] (absorbed) BTS 5.945 

2 (dof) 0.80 (58) 0.81 (51) 


The errors indicate 1c statistical uncertainties. The spectral models are a partially covered APEC model with a Gaussian line and an absorbed power-law model for the 14 hard (HR2 >0.3) and the 
12 soft sources (HR2 < 0.3), respectively. The line energy and width of the Gaussian line component are fixed to 6.4 keV and 0.01 keV to account for the neutral Fe-K fluorescence line. The parameters of 
the partial covering absorption component are poorly constrained owing to spectral fitting in the narrow 2-8-keV energy band. 
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Quantized Majorana conductance 
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Jouri D. S. Bommer!, Michiel W. A. de Moor!, Diana Car?, Roy L. M. Op het Veld’, Petrus J. van Veldhoven®, Sebastian Koelling?, 
Marcel A. Verheijen*°, Mihir Pendharkar®, Daniel J. Pennachio*, Borzoyeh Shojaei*’, Joon Sue Lee’, Chris J. Palmstrom*®’, 
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Majorana zero-modes—a type of localized quasiparticle—hold 
great promise for topological quantum computing!. Tunnelling 
spectroscopy in electrical transport is the primary tool for 
identifying the presence of Majorana zero-modes, for instance 
as a zero-bias peak in differential conductance’. The height of 
the Majorana zero-bias peak is predicted to be quantized at the 
universal conductance value of 2e?/h at zero temperature’ (where 
e is the charge of an electron and h is the Planck constant), as a 
direct consequence of the famous Majorana symmetry in which a 
particle is its own antiparticle. The Majorana symmetry protects 
the quantization against disorder, interactions and variations 
in the tunnel coupling*°. Previous experiments®, however, have 
mostly shown zero-bias peaks much smaller than 2e7/h, with a 
recent observation’ of a peak height close to 2e7/h. Here we report a 
quantized conductance plateau at 2e”/h in the zero-bias conductance 
measured in indium antimonide semiconductor nanowires covered 
with an aluminium superconducting shell. The height of our zero- 
bias peak remains constant despite changing parameters such as the 
magnetic field and tunnel coupling, indicating that it is a quantized 
conductance plateau. We distinguish this quantized Majorana peak 
from possible non-Majorana origins by investigating its robustness 
to electric and magnetic fields as well as its temperature dependence. 
The observation of a quantized conductance plateau strongly 
supports the existence of Majorana zero-modes in the system, 
consequently paving the way for future braiding experiments that 
could lead to topological quantum computing. 

A semiconductor nanowire coupled to a superconductor can be 
tuned into a topological superconductor with two Majorana zero- 
modes localized at the wire ends!*’. Tunnelling into a Majorana mode 
will show a zero-energy state in the tunnelling density-of-states, that 
is, a zero-bias peak (ZBP) in the differential conductance (dI/dV)**. 
This tunnelling process is an ‘Andreev reflection, in which an incom- 
ing electron is reflected as a hole. Particle-hole symmetry dictates 
that the zero-energy tunnelling amplitudes of electrons and holes are 
equal, resulting in a perfect resonant transmission with a ZBP height 
quantized at 2e/h (refs 3, 4, 10), irrespective of the precise tunnelling 
strength? *. The Majorana nature of this perfect Andreev reflection is a 
direct result of the well-known Majorana symmetry property ‘particle 
equals antiparticle!+. 

This predicted robust conductance quantization has not yet been 
observed?*7:!3:4, Instead, most of the ZBPs have a height consider- 
ably less than 2e’/h. This discrepancy was first explained by thermal 
averaging!>-'§, but that explanation does not hold when the peak width 
exceeds the thermal broadening (about 3.5kpT)'*“, In that case, other 
averaging mechanisms, such as dissipation”, have been invoked. The 
main source of dissipation is a finite quasiparticle density-of-states 


within the superconducting gap, often referred to as a ‘soft gap. 
Substantial advances have been achieved in ‘hardening’ the gap by 
improving the quality of materials, eliminating disorder and inter- 
face roughness”?”!, and better control during device processing’, 
all guided by a more detailed theoretical understanding”*. We have 
recently solved all these dissipation and disorder issues”', and here we 
report the resulting improvements in electrical transport leading to the 
elusive quantization of the Majorana ZBP. 

Figure 1a shows a micrograph of a fabricated device and schematics 
of the measurement set-up. An InSb nanowire (grey) is partially covered 
(two out of six facets) by a thin superconducting aluminium shell 
(green)*!. The ‘tunnel-gates’ (coral red) are used to induce a tunnel 
barrier in the non-covered segment between the left electrical contact 
(yellow) and the Al shell. The right contact is used to drain the current 
to ground. The chemical potential in the segment covered with Al can 
be tuned by applying voltages to the two long ‘super-gates’ (purple). 

Transport spectroscopy is shown in Fig. 1b, which displays dI/dV 
as a function of voltage bias V and magnetic field B (aligned with the 
nanowire axis), while fixed voltages are applied to the tunnel- and 
super-gates. As B increases, two levels detach from the gap edge 
(at about 0.2 meV), merge at zero bias and form a robust ZBP. This is 
consistent with the Majorana theory: a ZBP is formed after the Zeeman 
energy closes the trivial superconducting gap and re-opens a topological 
gap*”. The gap re-opening is not visible ina measurement of the local 
density-of-states because the tunnel coupling to these bulk states is 
small’°. Moreover, the finite length (about 1.2 um) of the proximitized 
segment (that is, the part that is superconducting because of the 
proximity effect from the superconducting Al coating) results in 
discrete energy states, turning the trivial-to-topological phase transition 
into a smooth crossover”®. Figure 1c shows two line-cuts from Fig. 1b 
extracted at 0 T and 0.88 T. Importantly, the height of the ZBP reaches 
the quantized value of 2e’/h. The line-cut at zero bias in the lower 
panel of Fig. 1b shows that the ZBP height remains close to 2e”/h over a 
sizable range in B field (0.75-0.92 T). Beyond this range, the height 
drops, most probably because of a closure of the superconducting gap 
in the bulk Al shell. 

We note that the sub-gap conductance at B=0 (black curve, left 
panel, Fig. 1c) isnot completely suppressed down to zero, reminiscent 
of a soft gap. In this case, however, this finite sub- gap conductance does 
not reflect any finite sub-gap density-of-states in the proximitized wire. 
It arises from Andreev reflection (that is, transport by dissipationless 
Cooper pairs) due to a high tunnelling transmission, which is evident 
from the above-gap conductance (dI/dV for V>0.2 mV) being larger 
than e?/h. As this softness does not result from dissipation, the Majorana 
peak height should still reach the quantized value’. In Extended Data 
Fig. 1, we show that this device tuned into a low-transmission regime, 
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Figure 1 | Quantized Majorana zero-bias peak. a, False-colour scanning 
electron micrograph of device A (upper panel) and its schematics (lower 
panel). Side gates and contacts are Cr/Au (10nm/100 nm). The Al shell 
thickness is approximately 10 nm. The substrate is p-doped Si, acting 

as a global back-gate, covered by 285 nm SiO}. The two tunnel-gates 

are shorted externally, as are the two super-gates. Scale bar, 500 nm. 

b, Magnetic field dependence of the quantized ZBP in device A with 

the zero-bias line-cut in the lower panel. Magnetic field direction 

is aligned with the nanowire axis for all measurements. Super-gate 
(tunnel-gate) voltage is fixed at —6.5 V (—7.7 V), while the back-gate is 


where di/dV does reflect the density-of-states, displays a hard gap 
(also shown in Extended Data Fig. 4, where the gap remains hard in a 
magnetic field). For further understanding, we use experimental 
parameters in a theoretical Majorana nanowire model’ (see Methods 
for more information). Figure 1d shows a simulation with two line-cuts 
shown in Fig. 1c (right panel). Besides the ZBP, other discrete sub-gap 
states are visible, which are due to the finite wire length. Such discrete 
lines are only faintly resolved in the experimental panels of Fig. 1b. 
Overall, we find good qualitative agreement between the experimental 
and simulation panels in Fig. 1b and d. An exact quantitative agreement 
is not feasible, as the precise experimental values for the parameters 
going into the theory (for example, chemical potential, tunnel coupling, 
Zeeman splitting or spin-orbit coupling) are unknown for our hybrid 
wire-superconductor structure. 

Next, we fix B at 0.8 T and investigate the robustness of the quantized 
ZBP against variations in transmission by varying the voltage on the 
tunnel-gate. Figure 2a shows dJ/dV while varying V and tunnel-gate 
voltage. Figure 2b shows that the ZBP height remains close to the 
quantized value. Importantly, the above-gap conductance measured 
at |V| =0.2 meV varies by more than 50% (Fig. 2c and d), implying 
that the transmission is changing considerably over this range while the 
ZBP remains quantized. The minor conductance switches in Fig. 2a-c 
are due to unstable jumps of trapped charges in the surroundings. 

Figure 2d (red curves) shows several line-cuts of the quantized ZBP. 
The extracted height and width are plotted in Fig. 2e (upper panel) 
as a function of above-gap conductance Gy=T x e’/h where T is the 
transmission probability for a spin-resolved channel. Although the ZBP 
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kept grounded. Temperature is 20 mK unless specified. c, Comparison 
between experiment and theory. Left (right) panel shows the vertical 
line-cuts from b (d) at 0 T and 0.88 T (1.07 meV). d, Majorana simulation 
of device A, assuming chemical potential 4, = 0.3 meV, tunnel barrier 
length (Lrg= 10 nm), with height Erg = 8 meV, and the superconductor- 
semiconductor coupling is 0.6 meV. See Methods for further information. 
A small dissipation broadening term (about 30 mK) is introduced for all 
simulations to account for the averaging effect from finite temperature and 
small lock-in excitation voltage (8 1V). 


width does change with Gy, the quantized height remains unaffected. 
Note that the ZBP width ranges from about 50 1eV to about 100 1eV, 
which is significantly wider than the thermal width of approximately 
61eV at 20 mK. The ZBP width is thus broadened by tunnel coupling, 
instead of thermal broadening, fulfilling a necessary condition to 
observe a quantized Majorana peak. In Extended Data Fig. 2, we show 
that in the low-transmission regime in which thermal broadening 
dominates over tunnel broadening, the ZBP height drops below 2e7/h 
(as explained in refs 15-18). The robustness of the ZBP quantization 
to a variation in the tunnel barrier is an important finding of our work. 

A more negative tunnel-gate voltage (<—8 V) eventually splits the 
ZBP, which may be explained by an overlapping of the two localized 
Majorana wavefunctions from the two wire ends. The tunnel-gate 
not only tunes the transmission of the barrier but also influences the 
potential profile in the proximitized wire part near the tunnel barrier. 
A more negative gate voltage effectively pushes the nearby Majorana 
mode away, towards the remote Majorana on the other end of the wire, 
thus reducing the length of the effective topological wire. This leads to 
the wavefunction overlap between the two Majorana modes, causing 
the ZBP to split!® (black curves in Fig. 2d). This splitting is also 
captured in our simulations shown in Fig. 2f, where we have checked 
that the splitting originates from Majorana wavefunction overlap. Note 
that the simulated ZBP height (red curve in middle panel in Fig. 2f) 
remains close to the 2e”/h plateau over a large range, whereas the 
above-gap conductance (black curve in lower panel in Fig. 2f) changes 
substantially. Also, the height and width dependence in the simulation 
is in qualitative agreement with our experimental observation (Fig. 2e). 


5 APRIL 2018 | VOL 556 | NATURE | 75 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


) 1 
= CI/AV (2€°/h) qu! 


0 1 
J GO  _——<————| 


-0.2 
= 
—£ 0 
> 
0.2 
iS Nhe at sie a ets ct ea es 
® 
NX 
> 
gz 
S 
0 
_ 0.5 
= 
a 0.4 
3 0.3 
e V=-0.2 mV 3 . 
0.2 eV= 0.2 mV 02 |V| = 0.28 mv 
81 -80 -79 -78 -77 O44 0 
Tunnel-gate voltage (V) Tunnel barrier height (meV) 
d e g 
B=0.8T 1.0 4 o-en-p—e-ong-2—_ me V, = 0.8 meV 
a oat er 
= sf 
= (ORSy wer = 5 
s r r 0.05 F w 
ie 0.35 0.45 0.55 SS 
5S 
3 }o0o7 2 & 
i) be. 
+0.05 
7 T T 0 
0.42 0.46 0.50 -0.04 0 0.04 
Gy (2€7/h) V (mV) 


Figure 2 | Quantized Majorana conductance plateau. a, Tunnel-gate 
dependence of the quantized ZBP at B= 0.8 T. Super-gate (back-gate) 
voltage is fixed at —6.5 V (0 V). b, c, Horizontal line-cuts from a, showing 
zero-bias conductance and above-gap conductance, respectively. The 
zero-bias conductance shows a quantized plateau. d, Several vertical 
line-cuts from a, showing ZBPs with quantized height (red curves). For 
the black curves, the zero-bias conductance drops below the quantized 
value owing to peak splitting. e, (Upper panel) ZBP height (red squares) 
and width (black dots) extracted from d (red curves), as a function of 


To complete the comparison, we show in Fig. 2g the simulated line-cuts 
of several quantized ZBPs (red curves) and split peaks (black curves), 
consistent with the experimental data in Fig. 2d. 

Pushing Majorana modes towards each other is one mechanism for 
splitting. Another way is by changing the chemical potential through the 
transition from a topological to a trivial phase*°—the quantum phase 
transition from the trivial to the topological phase can equivalently 
be caused by tuning either the Zeeman energy (that is, the magnetic 
field) or the chemical potential. Splitting at the phase transition occurs 
because the Majorana wavefunctions start to spread out over the entire 
wire length. For long wires, the transition is abrupt, whereas in shorter 
wires a smooth transition is expected”®. We investigate the dependence 
of the quantized ZBP on chemical potential by varying the voltage on 
the super-gate. Figure 3a shows a nearly quantized ZBP that remains 
non-split over a large range in the super-gate voltage. More positive 
voltage applied to the super-gates corresponds to a higher chemical 
potential, and eventually we find a ZBP splitting (around —5 V or more 
positive) and consequently a suppression of the zero-bias conductance 
below the quantized value. Although the relation between the gate 
voltage and chemical potential is unknown in our devices, this splitting 
suggests a transition to the trivial phase caused by a tuning of the 
chemical potential induced by the changing super-gate voltage. 

In a lower B field and different gate settings (Fig. 3b), the splitting 
of the quantized ZBP shows oscillatory behaviour as a function of the 
super-gate voltage. The five line-cuts on the right panel highlight this 
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above-gap conductance (Gy). The width is defined by the bias voltage at 
which dI/dV = e*/h. (Lower panel) ZBP height and width extracted from 
several simulation curves in f. f, Majorana simulation of the tunnel-gate 
dependence. We set the Zeeman field Vz =0.8 meV and chemical potential 
}t= 0.6 meV, such that the nanowire is in the topological regime. From 

left to right, the barrier width decreases linearly from 175 nm to 0 nm, as 
the barrier height decreases from 2.1 meV to 0. g, Vertical line-cuts from f 
show the quantized ZBP (red) and split peaks (black). 


back-and-forth behaviour between quantized and suppressed ZBPs. 
Notably, the ZBP height comes back up to the quantized value and 
does not cross through it. 

We find similar behaviour in the theoretical simulations of Fig. 3c. In 
these simulations, we have confirmed that for the chosen parameters, 
the Majorana wavefunctions oscillate in their overlap, thus giving rise 
to the back-and-forth behaviour of quantized and split ZBPs”’. In the 
experiment, it may also be that non-homogeneity, possibly somewhere in 
the middle of the wire, causes overlap of Majorana wavefunctions. Again, 
we note that the conversion from gate voltage to chemical potential 
is unknown, preventing a direct quantitative comparison between 
experiment and simulation. 

To demonstrate the reproducibility of ZBP quantization, we show 
in Fig. 4a the quantized ZBP data from a second device. In this second 
device, the length of the proximitized section is about 0.9 tum, which is 
about 0.3 1m shorter than in the previous device. The quantized ZBP 
plateau is indicated by the region between the two green dashed lines 
in Fig. 4b (red curve). This second device allows transmission of more 
than one channel through the tunnel barrier, which we deduce from 
the above-gap conductance value (Fig. 4b, lower panel, black curve) 
exceeding e’/h for tunnel-gate voltages higher than about —0.55 V. 
Correspondingly, the zero-bias conductance can now exceed 2e7/h 
(Fig. 4b, middle panel) for such an open tunnel barrier®. Tunnelling 
through the second channel in the barrier region results in an additional 
background conductance, thus leading to the zero-bias conductance 
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Figure 3 | Majorana peak splitting. a, Super-gate dependence of the 
quantized ZBP in device A at 0.9 T. As the super-gate increases the 
chemical potential, the ZBP height is nearly quantized before it splits. 
The tunnel-gate voltage is adjusted simultaneously when sweeping the 
super-gate voltage, to compensate for the cross coupling and keep the 
transmission roughly constant. Lower panel shows the zero-bias line-cut, 
and the right panels show vertical line-cuts at gate voltages indicated by 
the corresponding colour bars. Switches in the colour maps are due to 
charge jumps in the gate dielectric. b, Oscillatory behaviour of the ZBP 
splitting, where the two black arrows point at the peak splitting regions. 
c, Simulation also shows oscillatory splitting as a function of chemical 
potential. The Zeeman field is fixed at Vz = 1 meV. 


rising above 2e7/h. We find, however, from a rough estimate of this 
background contribution that the net ZBP height (above background) 
never exceeds 2e7/h, consistent with Majorana theory?. 

We next fix the B field and study temperature dependence. Figure 4c 
shows a line-cut of this quantized ZBP from Fig. 4a. First, the base 
temperature trace in Fig. 4c (red data points) fits well to a Lorentzian 
line-shape with a 20 mK thermal broadening, expected for Majoranas*° 
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Figure 4 | Quantized Majorana plateau reproduced, and temperature 
dependence. a, Magnetic field dependence of the quantized ZBP in 
device B. b, Tunnel-gate dependence of the ZBP at 0.83 T. The two lower 
panels are the horizontal line-cuts at bias voltage, V, of OmV and 0.2 mV. 
The two dashed green lines indicate the plateau region of the zero-bias 
conductance. c, Vertical line-cuts from a at 0 T and 0.84 T. The blue line 

is a Lorentzian fit with a tunnel coupling ‘= 13.7 jveV and temperature 
of 20 mK. d, Temperature dependence of this quantized ZBP while the 
temperature increases from 20 mK to 600 mK in steps of 10 mK. e, Colour 
plot of the temperature dependence in the upper panel with the simulation 
in the lower panel. At each temperature, the conductance is renormalized 
by setting the minimum to 0 and maximum to 1, for clarity. a.u., arbitrary 
units. f, Extracted ZBP height and FWHM as a function of temperature 
from e. Upper panel is the experiment; lower panel is the simulation with 
no fitting parameters. 


as well as for any type of resonant transmission. The ZBP temperature 
dependence is shown in line traces in Fig. 4d and in colour scale 
in Fig. 4e (with the corresponding simulation in the lower panel of 
Fig. 4e). Figure 4f shows the extracted ZBP height and ZBP width 
(full-width at half-maximum, FWHM) from both the experimental 
and simulated traces. At low temperatures, the ZBP width (red data 
points) exceeds the thermal width defined as 3.5kgT (blue line). In 
agreement with theory’, the ZBP height (black data points) reaches 
and saturates at 2e”/h when the FWHM exceeds 3.5kgT. For higher 
temperatures, thermal averaging starts to suppress the ZBP height 
below the quantized value. The simulated data are calculated by a 
convolution of the derivative of the Fermi distribution function 
and the dI/dV trace at a base temperature of 20 mK. This procedure 
of incorporating thermal effects holds if the temperature of the 
calculated dI/dV curve is significantly larger than base temperature 
(which can then be assumed to be the effective zero-temperature 
conductance value). We find excellent agreement between experiment 
and simulation for T > 50 mK (Fig. 4f). See Extended Data Fig. 3 for 
detailed temperature dependence. 
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Figure 5 | Trivial zero-bias peaks from Andreev bound states. 

a, Magnetic field dependence of a trivial ZBP in device C. The peak height 
reaches 2e/h at 0.7 T (red bar). b, Back-gate dependence of this ZBP, 
where the peak remains non-split for a sizable range of gate voltage. The 
peak height varies and is generally below 2e7/h. c, Tunnel-gate dependence 
of this ZBP, which is a result of level crossing. d, Back-gate dependence 

of the ZBP at 0.35 T, with the lower panel showing the zero-bias line-cut. 
e, Vertical line-cuts from d. f, Lorentzian fits (red curves) of three ZBP 
curves (black dots) taken from e, assuming a temperature broadening of 
20mK. 


Recent theoretical work”* has shown numerically for experimentally 
relevant parameters that ZBPs can also arise from local and non- 
topological Andreev bound states (ABS)!°?"*5, These local ABS appear 
remarkably similar in tunnelling spectroscopy to the ZBPs arising 
from Majorana zero-modes. In a third device, we are able to find such 
non-topological states by fine-tuning the gate voltages (see Extended 
Data Fig. 7 for specifics of all devices). Figure 5 shows the similarities 
and differences between ABS and Majorana ZBPs. First, Fig. 5a shows 
a ZBP in tunnelling spectroscopy versus B field. At a particular B field 
(0.7 T, red bar), the ZBP height reaches 2e”/h. In this device, we next 
vary the chemical potential by means of a voltage applied to a back-gate, 
producing a fairly stable (non-split) ZBP (Fig. 5b). In contrast, the ZBP 
is unstable against variations in tunnel-gate voltage: Fig. 5c shows that 
the ZBP now appears as level crossings instead of being rigidly bound to 
zero bias. The two different behaviours between back-gate and tunnel- 
gate are expected for ABSs that are localized near the tunnel barrier, as 
was modelled explicitly in ref. 28 (see also Extended Data Fig. 5). Liu 
et al.?8 show that local ABSs can have near-zero energy, which ina B 
field is remarkably robust against variations in chemical potential, in 
our experiment tuned by the back-gate. But this is only the case for the 
tunnel-gate voltage fine-tuned to level crossing points at zero bias. The 
local tunnel-gate and the global back-gate thus have distinguishably 


78 | NATURE | VOL 556 | 5 APRIL 2018 


different effects. For the Majorana case, instead of level crossing, the 
ZBP should remain non-split over sizable changes in tunnel-gate 
voltage'*¥°, as shown in Fig. 2a and Fig. 4b. 

The second fundamental difference is that the non-topological ABS 
ZBP height is not expected to be robustly quantized at 2e7/h (refs 5, 28). 
Figure 5d and e shows that the ZBP height varies smoothly as a 
function of the back-gate voltage without any particular feature at 
2e?/h. The ZBP height in Fig. 5a at 2e*/h is just a tuned coincidence 
(see Extended Data Fig. 6). Note that the ZBP line-shape or tem- 
perature dependence does not discriminate between topological and 
non-topological cases. Both fit a Lorentzian line-shape as shown 
explicitly for the non-topological ABS in Fig. 5f. Thus, the tempera- 
ture dependence alone cannot distinguish a Majorana origin from 
ABS”*!?_ Only a stable quantized tunnel-conductance plateau, robust 
against variations in all gate voltages and magnetic field strength, can 
uniquely identify a topological Majorana zero-mode in tunnelling 
spectroscopy. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Theory model. We use the theoretical model from ref. 28 to perform numerical 
simulations with experimentally relevant parameters, such as the effective mass 
m* =0.015m,, the spin-orbit coupling a=0.5eV A, the chemical potential of the 
normal metal lead /ijeaqa= 25 meV, the Landé g-factor g= 40 such that the Zeeman 
energy Vz [meV] = 1.2B [T], and the length of the nanowire L= 1.0|1m. Note 
that the collapse of the bulk Al superconducting gap is included explicitly in the 
theory to be consistent with the experimental situation in which the bulk gap 
collapses at about 1 T. 


Lorentzian fit. We fit our ZBP line-shape with the Lorentzian formula: 


G(V) = 20° _I” _ where I’defines the tunnel coupling and FWHM of the 
h 7?+4(ev)? 

peak, that is, 22. Then we do convolution integration with the derivative of the 
Fermi distribution function (at 20 mK) to fit our ZBP shape. Because the FWHM 
of our ZBP is much larger than the thermal width, we take Ito be roughly equal 
to half of the FWHM for all the fittings in Fig. 4c and Fig. 5f. 

Data availability. The data that support the findings of this study are available within 
the paper. Additional data are available from the corresponding authors upon request. 
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Extended Data Figure 1 | Apparent ‘soft gap’ due to large Andreev 
reflection. a, Differential conductance dI/dV of the device in Figs 1-3 
(device A) as a function of bias voltage at zero magnetic field. The tunnel- 
gate voltage is tuned to more negative from the top curve to the bottom 
curve. The transmission probability of the tunnel barrier is tuned from 
large (black curve) to small (orange curve). In the low transmission regime 
(orange curve), where the above-gap conductance (about 0.03 x 2e?/h) is 
much less than 2e”/h, dI/dV is proportional to the density of states in the 
proximitized wire part, resolving a hard superconducting gap. In the high 


transmission regime (black curve), where the above-gap conductance 

is comparable with 2e”/h, the finite sub-gap conductance is due to large 
Andreev reflection. This ‘soft gap’ is not from dissipation, and does not 
affect the quantized ZBP height as shown in c. b, Re-plot of the two 
extreme curves from a, for clarity. c, Waterfall plot of Fig. 1b, showing 
all the individual curves from 0T to 1 T in steps of 0.02 T. The curves are 
offset vertically by 0.066 x 2e’/h for clarity. The curve at 0 T and the red 
curve at 0.88 T correspond to the curves in Fig. 1c (left panel). 
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Extended Data Figure 2 | Thermal-broadened ZBP inlowtransmission —_ value. The sub-gap conductance at finite B (for example, 0.88 T or 0.94T), 
regime. a, Differential conductance di/dV of device D, as a function of where the ZBP appears, is the same as the sub-gap conductance at zero 
B, showing a stable ZBP. b, Vertical line-cuts at 0 T, 0.88 T and 0.94T. At field, indicating that the gap remains hard at high magnetic field where the 
B=OT, the above-gap conductance (approximately 0.05 x 2e”/h) is much Majorana state is present. c, The zoom-in curves show that the FWHM of 
less than 2e?/h, which means that the device is in the low transmission the ZBP is about 28 j1eV, which is consistent with the combined effect of 
regime, and thus shows a hard gap. The tiny sub-gap conductance is the thermal broadening (3.5kgT © 6 eV at 20 mK), the lock-in bias voltage 
due to the small Andreev reflection and the noise background of the excitation (5 |1eV) and broadening from tunnelling. This shows that the 
measurement equipment. The low transmission leads to a narrow ZBP thermal broadening does indeed dominate over tunnel broadening. 
width, which is negligible compared with the thermal width of 3.5k,T. d, Waterfall plot of a with vertical offset of 0.01 x 2e7/h for clarity. 


Thus, thermal averaging suppresses the ZBP height below the quantized 
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Extended Data Figure 3 | Simulation of temperature dependence 
on the quantized ZBP. a, False-colour scanning electron micrograph 
of device B with data shown in Fig. 4. Scale bar is 11m. The length 
of the Al section is about 0.9 1m. We calculate the dI/dV curve at 
high temperature by convolution of the derivative of the Fermi 
distribution function with the di/dV curve at base temperature 


of 20 mK: dI/dV= G(V, T) = i deG(e, ors? where 
T is temperature, V is bias voltage, and f(E,T) is the Fermi distribution 
function. Because we use the dI/dV curve at 20 mK as the 


zero-temperature data, our model only works for T sufficiently 


larger than 20 mK, that is, T > 50 mK. b, Comparison between the 
experimental data (left, taken from Fig. 4d) and theory simulations, for 
different temperatures. c, Several typical curves at different temperatures; 
black traces are the experimental data, and the red curves are the theory 
simulations with no fitting parameters. The agreement between simulation 
and experiment indicates that thermal averaging effect is the dominating 
effect that smears out the ZBP at high temperature. d, Temperature 
dependence of the ZBP taken from our theory model: Fig. 1c (right panel). 
The temperature varies from 25 mK to 700 mK in steps of 23 mK. 
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Extended Data Figure 4 | Perfect ballistic Andreev transport in InSb-Al 
nanowires. a, False-colour scanning electron micrograph of the device in 
Fig. 5 (device C). Scale bar is 500 nm. Electrical contacts and top gates are 
Cr/Au. Lower panel shows the device schematic and measurement set-up. 
The two top-gates (tunnel-gate and super-gate) are separated from the 
nanowire by 30-nm-thick SiN dielectric. The global back gate is p-doped 
Si covered by 285-nm-thick SiO, dielectric. b, Differential conductance 
dl/dV, as a function of bias voltage (V) and tunnel-gate voltage at zero 
field. No localization effect (conductance resonances or quantum-dot- 
induced Coulomb blockade) is observed. c, Vertical line-cuts from b at 
tunnel-gate voltage of —0.18 V (lower panel) and —0.12 V (upper panel), 
showing a hard superconducting gap in the low transmission regime 
(lower panel) and strong Andreev enhancement in the open regime 
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(upper panel). d, Horizontal line-cuts from ¢ for V=0 mV (pink, sub- 

gap conductance, Gs) and V=0.45 mV (green, above-gap conductance, 
Gy). The blue curve is the calculated sub-gap conductance using 

Gs = 4e?/h x T?/(2 — T)*, where the transmission T is extracted from the 
above-gap conductance: Gy = (2e”/h) x T. e, Sub-gap conductance Gs 

as a function of Gy (black dots) and the theory prediction (red curve): 

Gs = 2Gy?/(2 — Gy)’, with Gs and Gy in unit of 2e?/h. Both d and e show 
perfect agreement between theory and experiment. This indicates that the 
sub-gap conductance is indeed dominated by the Andreev reflection, that 
is, without contributions from sub-gap states. f, Magnetic field dependence 
of the hard gap. Lower panel shows the zero-bias line-cut. The gap remains 
hard up to 1 T, where the bulk superconducting gap closes. 
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Extended Data Figure 5 | Majoranas versus trivial Andreev bound 
states. a, b, Schematics of a Majorana nanowire device. The only difference 
between the left column (Majorana) and right column (ABS) is the 
chemical potential, as shown in c and d. ¢, d, Potential profile in the 
device. The tunnel barrier height is 10 meV and the width is 10 nm. The 
dot potential shape is E(x) = —Vp sin(tx/laot), for x between 0 and 0.3 1m, 
where the length of the dot (/got) is 0.3 sm, and Vp is the dot depth which 
can be tuned by the nearby gate, that is, the tunnel-gate. The rest of the flat 
nanowire segment is 1 jm long. We assume a pairing potential 

A=0.2 meV, with a spin-orbit coupling of 0.5eV A. We set the Zeeman 
energy to be 1 meV, so the chemical potential of 0.5 meV (left) corresponds 
to the topological regime, and 1.2 meV (right) corresponds to the trivial 
regime, based on the topological condition Vz > ,/ ju? -+ A”, where ju is 
chemical potential. e, f, Spatial distribution of the Majorana and ABS 
wavefunctions in the topological and trivial regime. In the topological 
regime, two spatially well separated Majoranas (red and black) are 
localized at the two ends of the topological section. In the trivial regime, 


LETTER 


b Andreev bound states 


Superconductor 


Nanowire 


Tunnel Super 
gate —eFFFTFF—SCis gate 
St Se Se Ss 5 al 
0 0.4 0.8 1.2 
d x (um) 
10; 
= 5 
S 
a «O 
34 V,=1meV 
-10 w= 1.2 meV ; 
0 0.4 0.8 12 
x (um) 
fj 
Trivial ABS : strongly 
overlapped Majoranas 
= V,=1meV 
u=1.2 meV 
V,= 1.03 meV 
0- 
0 0.4 0.8 1.2 
X (um) 
h di/dV (2e?/h) ———EEE 
0 


Dot potential (meV) 


j 
Trivial : no plateau 
= 
ey 14--#%------z3-4 
a 3 
> 
ae) 
3 
re) 
V=OmV 
0 T T T T 
0 1 2 3 4 


Dot potential (meV) 


the Andreev bound state, which can be considered as two strongly 
overlapped Majoranas (red and black), is localized near the tunnel barrier. 
g, h, The Majorana ZBP remains non-split against the change of dot 
potential, regardless of the energy of the dot level. The green arrow 
indicates one bound state in the dot, whose wavefunction |Y| is shown in 
e (green curve). When this dot level moves down, it is repelled from zero 
energy, where the Majorana ZBP remains bound to zero (inset of i). On the 
contrary, the ABS-induced ZBP is not robust at all and only shows up at 
the crossing points of two Andreev levels. This is because the tunnel-gate 
tunes the dot potential, which therefore affects the energy of the localized 
ABS near the tunnel barrier. i, j, The Majorana ZBP height shows a 
quantized plateau at 2e/h by tuning the dot potential with tunnel-gate. 
The ZBP height drops from the quantized value (inset) when the ABS-dot 
level moves towards zero, which effectively squeezes the ZBP-width such 
that the thermal averaging effect starts to dominate. The ABS zero-bias 
conductance does not show a plateau, but instead varies between 0 and 
4e*/h. 
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Extended Data Figure 6 | Magnetic field dependence of trivial Andreev (labelled in the panels). b, Line-cuts of the ZBP data from a. The ZBP 
bound states. a, Top panel is a re-plot of the trivial ABS data in Fig. 5a. height varies with back-gate voltages and can exceed 2e”/h. The ZBP height 


Middle and bottom panels are the ZBP data at different back-gate voltages —_at 2e*/h here is just a tuned coincidence. 
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Extended Data Figure 7 | Specifics of devices. We fabricated and tested 
many (over 60) devices out of which we selected 11 devices that showed 
good basic transport with all gates being fully functional. These were used 
for extensive measurements. Although most of these devices show ZBPs 
after tuning gate voltages and magnetic field, only two devices (presented 
in the main text: Figs 1-3 for device A and Fig. 4 for device B) show a 
quantized ZBP plateau. All other devices show trivial ZBPs similar to 

Fig. 5 (from device C). Scanning electron microscope images of devices 
A, B and C are shown in Fig. 1a, Extended Data Fig. 3a and Extended 
Data Fig. 4a, respectively. Here we show the scanning electron microscope 
images of the other eight devices, which we have explored extensively, but 
without finding a quantized ZBP plateau. Devices 1 and 2 are side-gate 


LETTER 


Single layer top gate device 


Super-gate 

SiN dielectric 
Tunnel-gate 
Remote super-gate 


Substrate 


HE Back-gate 


devices. Device 3 has a top tunnel-gate separated from the nanowire by 
30-nm-thick SiN dielectric, and a global back-gate separated by 285-nm- 
thick SiO. Devices 4 and 5 have tunnel-gate and super-gate on top 
separated from the nanowire by 30-nm-thick SiN dielectric. Devices 6 to 8 
have two layers of top-gate. The bottom layer has a tunnel-gate separated 
by 30-nm-thick SiN dielectric while the top layer has super-gates separated 
by 30-nm-thick SiN from the bottom layer. The scale bar is 1 1m for all 
devices, except for device 2, which is 500 nm. It would be informative 

to perform Schrodinger—Poisson calculations on these different device 
geometries to determine the self-consistent potential landscape and 

find out which geometry suppresses a local potential dip near the tunnel 
barrier. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


doi:10.1038/nature26154 


Correlated insulator behaviour at half- filling in 
magic-angle graphene superlattices 


Yuan Caol, Valla Fatemi!, Anmet Demir!, Shiang Fang’, Spencer L. Tomarken!, Jason Y. Luo, Javier D. Sanchez-Yamagishi?, 
Kenji Watanabe’, Takashi Taniguchi’, Efthimios Kaxiras**, Ray C. Ashoori! & Pablo Jarillo-Herrero! 


A van der Waals heterostructure is a type of metamaterial that 
consists of vertically stacked two-dimensional building blocks held 
together by the van der Waals forces between the layers. This design 
means that the properties of van der Waals heterostructures can be 
engineered precisely, even more so than those of two-dimensional 
materials’. One such property is the ‘twist’ angle between different 
layers in the heterostructure. This angle has a crucial role in the 
electronic properties of van der Waals heterostructures, but does 
not have a direct analogue in other types of heterostructure, 
such as semiconductors grown using molecular beam epitaxy. 
For small twist angles, the moiré pattern that is produced by the 
lattice misorientation between the two-dimensional layers creates 
long-range modulation of the stacking order. So far, studies of the 
effects of the twist angle in van der Waals heterostructures have 
concentrated mostly on heterostructures consisting of monolayer 
graphene on top of hexagonal boron nitride, which exhibit 
relatively weak interlayer interaction owing to the large bandgap 
in hexagonal boron nitride**. Here we study a heterostructure 
consisting of bilayer graphene, in which the two graphene layers 
are twisted relative to each other by a certain angle. We show 
experimentally that, as predicted theoretically®, when this angle is 
close to the ‘magic’ angle the electronic band structure near zero 
Fermi energy becomes flat, owing to strong interlayer coupling. 
These flat bands exhibit insulating states at half-filling, which 
are not expected in the absence of correlations between electrons. 
We show that these correlated states at half-filling are consistent 
with Mott-like insulator states, which can arise from electrons 
being localized in the superlattice that is induced by the moiré 
pattern. These properties of magic-angle-twisted bilayer graphene 
heterostructures suggest that these materials could be used to study 
other exotic many-body quantum phases in two dimensions in 
the absence of a magnetic field. The accessibility of the flat bands 
through electrical tunability and the bandwidth tunability through 
the twist angle could pave the way towards more exotic correlated 
systems, such as unconventional superconductors and quantum 
spin liquids. 

Exotic quantum phenomena, such as superconductivity and the frac- 
tional quantum Hall effect, often occur in condensed-matter systems 
and other systems with a high density of states. One way of creating a 
high density of states is to have ‘flat’ bands, which have weak dispersion 
in momentum space, with the kinetic energy of the electron set by the 
bandwidth W. When the Fermi level lies within the flat bands, 
Coulomb interactions (U) can greatly exceed the kinetic energy of the 
electrons and drive the system into various strongly correlated phases 
(U/W-> 1)7"!!. The study of such flat-band systems in bulk materials 
is therefore scientifically important, and the search for new flat-band 
systems, such as in kagome and Lieb lattices and in heavy-fermion 
systems, is ongoing”~|”. 


Recent advances in two-dimensional materials have provided 
a new route to achieving flat bands. An inherent advantage of two- 
dimensional materials is that the chemical potential of electrons can 
be tuned continuously via the electric-field effect without introducing 
extra disorder. In a twisted van der Waals heterostructure, the mismatch 
between two similar lattices generates a moiré pattern (Fig. 1b). 
This additional periodicity, which can have a length scale orders of 
magnitude larger than that of the underlying atomic lattices, has been 
shown to create a fractal energy spectrum in a strong magnetic field’. 
In twisted layers, the interlayer hybridization is modulated by the 
moiré pattern as well. As an example, the band structure of twisted 
bilayer graphene (TBG) can be tailored to generate bandgaps and band 
curvatures that are otherwise absent®!*-'”. Although the well-known 
building blocks for van der Waals heterostructures, such as graphene 
and transition-metal dichalcogenides, do not have intrinsic flat bands 
at low energies, it has been predicted theoretically that flat bands could 
exist in TBG®'*"!%!8, Here we demonstrate experimentally that when 
the twist angle of TBG is close to the theoretically predicted magic 
angle, the interlayer hybridization induces nearly flat low-energy bands. 
This quenching of the quantum kinetic energy leads to a correlated 
insulating phase at half-filling of these flat bands, which is indicative 
of a Mott-like insulator in the localized flat bands. 

To zeroth order, the low-energy band structure of TBG can be con- 
sidered as two sets of monolayer-graphene Dirac cones rotated about 
the [ point in the Brillouin zone by the twist angle 6 (Fig. 1d)°. The 
difference between the two K (or K’) wavevectors gives rise to the mini 
Brillouin zone (shown as a small hexagon), which is generated from 
the reciprocal lattice of the moiré superlattice (Fig. 1d). The Dirac 
cones near either the K or K’ valley mix through interlayer hybridiza- 
tion, whereas interactions between distant Dirac cones are suppressed 
exponentially®'’. As a result, the valley remains (for all practical 
purposes) a good quantum number. Two experimentally verified con- 
sequences of this hybridization are the energy gaps that open near the 
intersection of the Dirac cones and the renormalization of the Fermi 
velocity 


A i ; | ViEk|k=K,k/ 

at the Dirac points!3 19-71. 

The theoretically calculated magic angles ae with i=1,2,...,are 
a series of twist angles at which the Fermi velocity at the Dirac points 
becomes zero®. The resulting low-energy bands near these twist angles 
are confined to less than about 10 meV. The flattening of the energy 
bands near the magic angle can be understood qualitatively from the 
competition between the kinetic energy and the interlayer hybridiza- 
tion energy (Fig. le-g). Intuitively, when the hybridization energy 2 w 
is comparable to or larger than Avokg, where vp = 10° ms“! is the Fermi 
velocity of graphene, ky Gx is the momentum displacement of the 
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Figure 1 | Electronic band structure of twisted bilayer graphene (TBG). 
a, Schematic of the TBG devices. The TBG is encapsulated in hexagonal 
boron nitride flakes with thicknesses of about 10-30 nm. The devices are 
fabricated on SiO./Si substrates. The conductance is measured with a 
voltage bias of 100 j1V while varying the local bottom gate voltage Vg. 

‘S and ‘D’ are the source and drain contacts, respectively. b, The moiré 
pattern as seen in TBG. The moiré wavelength is \= a/[2sin(0/2)], where 
a= 0.246 nm is the lattice constant of graphene and 0 is the twist angle. 

c, The band energy E of magic-angle (9 = 1.08°) TBG calculated using an 
ab initio tight-binding method. The bands shown in blue are the flat bands 
that we study. d, The mini Brillouin zone is constructed from the 
difference between the two K (or K’) wavevectors for the two layers. 


Dirac cones, Gx = 411/(3a) is the magnitude of the wavevector [-K 
of graphene, a = 0.246 nm is the lattice constant of graphene and 
h=h/(27) is the reduced Planck constant, the lower of the hybri- 
dized states is pushed to and crosses zero energy. A mathematical 
derivation of the magic-angle condition® gives the first magic angle, 
9°) |= V3 w/(AvoGx) ¥ 1.1°. In Fig. 1c we show an ab initio tight- 
binding calculation’ of the band structure for 9=1.08°. The flat bands 
(coloured blue) have a bandwidth of 12 meV for the E> 0 branch and 
2meV for the E <0 branch (where E is the band energy). From a 
band-theory point of view, the flat bands should have localized wave- 
function profiles in real space. In Fig. 1h we show the local density of 
states calculated for the flat bands. The wavefunctions are indeed highly 
concentrated in the regions with AA stacking, whereas small but non- 
zero amplitudes on the AB and BA regions connect the AA regions and 
endow the bands with weak dispersion®'>'®. A brief discussion about 
the topological structure of the bands near the first magic angle is given 
in Methods and Extended Data Fig. 1. 

For the experiment, we fabricated high-quality encapsulated TBG 
devices with the twist angle controlled to an accuracy of about 0.1°-0.2° 
using a previously developed ‘tear and stack’ technique!*'”**. We meas- 
ured four devices with twist angles near the first magic angle 
ae ~1.1°. In Fig. 2a we show the low-temperature two-probe 
conductance of device D1 as a function of carrier density n. For 
n= -+n,=+2.7 x 10'2cm (four electrons per moiré unit cell for 
6 =1.08°), the conductance is zero over a wide range of densities. Here, 
ng refers to the density that is required to fill the mini Brillouin zone, 
accounting for spin and valley degeneracies (see Methods). These 
insulating states have been explained previously as hybridization- 
induced bandgaps above and below the lowest-energy superlattice 
bands, and are hereafter referred to as ‘superlattice gaps’. The thermal 
activation gaps are measured to be about 40 meV (see Methods)'*!”. 
The twist angle can be estimated from the density that is required to 
reach the superlattice gaps, which we find to be @=1.1° +0.1° for all 
of the devices reported here. 


f 9g 
a y 
Ve = Vo = 
w=0 2w <K tok, 2w = hv ok, 


BA 


Hybridization occurs between Dirac cones within each valley, whereas 
intervalley processes are strongly suppressed. K,, K’,, M, and [’, denote 
points in the mini Brillouin zone. e-g, Illustration of the effect of interlayer 
hybridization for w=0 (e), 2w < hvokg (f) and 2w = hvoko (g); 

Vo = 10°m ss“ is the Fermi velocity of graphene. h, Normalized local 
density of states (LDOS) calculated for the flat bands with E > 0 at 

= 1.08°. The electron density is strongly concentrated at the regions with 
AA stacking order, whereas it is mostly depleted at AB- and BA-stacked 
regions. See Extended Data Fig. 6 for the density of states versus energy at 
the same twist angle. i, Top view of a simplified model of the stacking 
order. 


Another pair of insulating states occurs for a narrower density range, 
near half the superlattice density: n+ +n,/2=+1.4 x 10!*cm~? (two 
electrons per moiré unit cell). These insulating states have a much 
smaller energy scale. This behaviour is markedly different from all 
other zero-field insulating behaviours reported previously, which 
occur at integer multiples of +n, (refs 13, 17). We refer to the states that 
occur near +n,/2 as ‘half-filling insulating states. They are observed 
at roughly the same density for all four devices (Fig. 2a, inset). In 
Fig. 2b-d we show the conductance of the half-filling states in device 
D1 at different temperatures. Above 4K, the system behaves as a metal, 
exhibiting decreasing conductance with increasing temperature. 
A metal-insulator transition occurs at around 4K. The conductance 
drops substantially from 4K to 0.3K, with the minimum value decreasing 
by 1.5 orders of magnitude. An Arrhenius fit yields a thermal acti- 
vation gap of about 0.3 meV for the half-filling states, two orders of 
magnitude smaller than those of the superlattice gaps. At the lowest 
temperatures, the system can be limited by conduction through charge 
puddles, resulting in deviation from the Arrhenius fit. 

To confirm the existence of the half-filling states, we performed 
capacitance measurements on device D2 using an a.c. low-temperature 
capacitance bridge (Extended Data Fig. 2)?°. The real and imaginary 
components of the a.c. measurement provide information about the 
change in capacitance and the loss tangent of the device, respectively. 
The latter signal is tied to the dissipation in the device due to its 
resistance**, Device D2 exhibits a reduction in capacitance and strong 
enhancement of dissipation at -tn,/2 (Fig. 3a), in agreement with an 
insulating phase that results from the suppression of the density of 
states. The insulating state at —n,/2 is weaker and visible only in the 
dissipation data. The observation of capacitance reduction (that is, 
suppression of density of states) for only the n-side half-filling state in 
this device may be due to an asymmetric band structure or the quality 
of the device. The reduction (enhancement) in capacitance (dissipa- 
tion) vanishes when the device is warmed up from 0.3 K to about 2 K, 
consistent with the behaviour observed in transport measurements. 
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Figure 2 | Half-filling insulating states in magic-angle TBG. a, Measured 
conductance G of magic-angle TBG device D1 with @ = 1.08° and 
T=0.3 K. The Dirac point is located at n = 0. The lighter-shaded regions 
are superlattice gaps at carrier density n = +n, = -£2.7 x 10cm”. The 
darker-shaded regions denote half-filling states at -En,/2. The inset shows 
the density locations of half-filling states in the four different devices. 


The emergence of half-filling states is not expected in the absence of 
interactions between electrons and appears to be correlated with the 
narrow bandwidth near the first magic angle. In our experiment, sev- 
eral separate pieces of evidence support the presence of flat bands. First, 
we measured the temperature dependence of the amplitude of 
Shubnikov-de Haas oscillations in device D1, from which we extracted 
the effective mass of the electron, m* (Fig. 3b; see Methods and 
Extended Data Fig. 3 for analysis). For a Dirac spectrum with eight-fold 
degeneracy (spin, valley and layer), we expect that m* = h?n/(8nv;)s 
which scales as 1/vp. The large measured m* near charge neutrality in 
device D1 indicates a reduction in vg by a factor of 25 compared to 
monolayer graphene (4 x 10'm s~' compared to 10°m s~'). This large 
reduction in the Fermi velocity is a characteristic that is expected for flat 
bands. Second, we analysed the capacitance data of device D2 near the 
Dirac point (Fig. 3a) and found that vp needs to be reduced to about 
0.15vo for a good fit to the data (Methods, Extended Data Fig. 1b). Third, 
another direct manifestation of flat bands is the flattening of the con- 
ductance minimum at charge neutrality above a temperature of 40 K 
(thermal energy kT = 3.5 meV), as seen in Fig. 3c. Although the con- 
ductance minimum in monolayer graphene can be observed clearly even 
near room temperature, it is smeared out in magic-angle TBG when the 
thermal energy kT becomes comparable to vpkg/2 ~ 4meV—the energy 
scale that spans the Dirac-like portion of the band (Fig. 1c)?**°. 

Owing to the localized nature of the electrons, a plausible explanation 
for the gapped behaviour at half-filling is the formation of a Mott-like 
insulator driven by Coulomb interactions between electrons”””*. To 
this end, we consider a Hubbard model on a triangular lattice, with 
each site corresponding to a localized region with AA stacking in the 
moiré pattern (Fig. li). In Fig. 3d we show the bandwidth of the E>0 
branch of the low-energy bands for 0.04° < 6 < 2° that we calculated 
numerically using a continuum model of TBG®. The bandwidth W is 
strongly suppressed near the magic angles. The on-site Coulomb energy 
Uof each site is estimated to be e”/(4ned), where d is the effective linear 
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Carrier density, n (101? cm-?) 


See Methods for a definition of the error bars. b, Minimum conductance 
values in the p-side (red) and n-side (blue) half-filling states in device 
D1. The dashed lines are fits of exp[—A/(2kT)] to the data, where 
Ax0.31 meV is the thermal activation gap. c, d, Temperature-dependent 
conductance of D1 for temperatures from about 0.3 K (black) to 1.7K 
(orange) near the p-side (c) and n-side (d) half-filling states. 


dimension of each site (with the same length scale as the moiré period), 
€ is the effective dielectric constant including screening and e is the 
electron charge. Combining < and the dependence of d on twist angle 
into a single constant kK, we write U=e76/(4ne Ka), where a=0.246nm 
is the lattice constant of monolayer graphene. In Fig. 3d we plot the 
on-site energy U versus 6 for k =4-20. As a reference, «= 4 if we 
assume € = 10 and d is 40% of the moiré wavelength. For a range of 
possible values of « it is therefore reasonable that U/W > 1 occurs near 
the magic angles and results in half-filling Mott-like gaps*”. However, 
the realistic scenario is much more complicated than these simplistic 
estimates; a complete understanding requires detailed theoretical anal- 
yses of the interactions responsible for the correlated gaps. 

The Shubnikoy-de Haas oscillation frequency fsau (Fig. 3b) also 
supports the existence of Mott-like correlated gaps at half-filling. Near 
the charge neutrality point, the oscillation frequency closely follows 
fsa = G0\n|/M where $9 =h/e is the flux quantum and M =4 indicates 
the spin and valley degeneracies. However, at |n| >1,/2, we observe 
oscillation frequencies that corresponds to straight lines, fgar1 = Go(|n| 
—n,/2)/M, in which M has a reduced value of 2. Moreover, these lines 
extrapolate to zero exactly at the densities of the half-filling states, n = 
n,/2. These oscillations point to small Fermi pockets that result from 
doping the half-filling states, which might originate from charged 
quasiparticles near a Mott-like insulator phase”’. The halved degener- 
acy of the Fermi pockets might be related to the spin-charge separation 
that is predicted in a Mott insulator”. These results are also supported 
by Hall measurements at 0.3 K (Extended Data Fig. 4; see Methods for 
discussion), which show a ‘resetting’ of the Hall densities when the 
system is electrostatically doped beyond the Mott-like states. 

The half-filling states at n,/2 are suppressed by the application 
of a magnetic field. In Fig. 4a, b we show that both insulating phases 
start to conduct at a perpendicular field of B=4T and recover normal 
conductance by B=8 TT. A similar effect is observed for an in-plane 
magnetic field (Extended Data Fig. 5d). The insensitivity to field 
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Figure 3 | Flat bands in magic-angle TBG. a, Capacitance measurements 
of device D2 at 0.3 K (blue) and 2 K (pink). The change in the measured 
capacitance (AC; upper traces) is plotted on the left axis and the loss 
tangent (lower traces) is shown on the right axis. For densities 
corresponding to half-filling (+:n,/2; dashed vertical lines), a reduction in 
AC (on the p-side only) and an enhancement in loss tangent (on both 
sides) are observed in the 0.3 K data. These effects disappear in the 2K 
measurements. b, The effective mass m* and oscillation frequency fsax as 
extracted from temperature-dependent Shubnikov-de Haas oscillations. 
The fitting curve (red dashed line) is m* = ,/h?|n|/(87v2), assuming a 
uniform Fermi velocity of vp. For magic-angle device D1, the estimated 
Fermi velocity of vs=4 x 104ms_! isa factor of 25 less than that for 
pristine graphene, vp=10°m s~!. The measured oscillation frequencies 
indicate the existence of small Fermi pockets that start from the half-filling 
states, with half the degeneracy of the main Fermi surface of the Dirac 
points. Shaded regions at half-filling and full-filling correspond to the 


orientation suggests that the suppression of the half-filling states 
is due to a Zeeman effect rather than an orbital effect, because the 
latter would be affected by only the perpendicular component of 
the magnetic field. For an effective g-factor of g=2 due to electron 
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Figure 4 | Magnetic-field response of the half-filling insulating phases. 
a, b, Dependence of the conductance on the perpendicular magnetic field 
B, of the half-filling states for device D1 on the p-side (a) and the n-side 
(b). The measurement is taken at 0.3 K. c, Arrhenius plot (cirlces) of the 
conductance of the p-side half-filling state at different magnetic fields. 
The inset shows the thermal activation gap A extracted from fitting the 
data in the main plot with exp[—A/(2kT)] (solid lines). d-f, Schematics 
of the density of states (DOS) in different scenarios. The single-particle 
flat bands (E> 0 and E <0 bands are both shown, with Ey in the E> 0 


Twist angle, 0 (°) 


shaded rectangles in Fig. 2a. The error bars in m* and fgax give the 
uncertainty of fitting to the Lifshitz—-Kosevich formula (defined in 
Methods) and correspond to the 90% confidence level. The blue dashed 
curves denote the fgax that is expected for Fermi surfaces with degeneracy 
M=4and M=2, starting at charge neutrality and at the half-filling states, 
respectively. c, Gate dependence of the conductance of device D1 at 
different temperatures, 4.5 K, 8 K, 15K, 29K, 40K, 50K, 70K, 100K and 
120K. The curves are each shifted vertically by 0.006 mS for clarity. 

See Extended Data Fig. 5a, b for the temperature dependence up to 

room temperature. d, Comparison between the bandwidth W for the E>0 
flat-band branch in TBG (thick blue line) and the on-site energy U= e?0/ 
(4ne Ka) (thin coloured lines for different values of «) for different twist 
angles 0. Near the magic angles go. ic 1.19, 0.5°, ... fori=1,2,...,U>W 
is satisfied for a range of possible values of « (defined in the main text) and 
so the system can be driven into a Mott-like insulator state. 


spin, the Zeeman energy that is needed to suppress the half-filling 
states is approximately gjipB = 0.5 meV, where [ug is the Bohr magne- 
ton—the same order of magnitude as that of the thermal excitation 
energy. 
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band (n-doping); d) are split into upper and lower many-body bands by 
interactions (e). This occurs when E£; is at half-filling of the upper band. 
Upon applying a Zeeman field (B = 0), the excitations can be further 
polarized, and can close the charge gap when the Zeeman energy gyipB is 
comparable to the gap A (f). Purple shading denotes a spin-degenerate 
band, whereas blue and red shading denotes spin-up and spin-down 
bands, respectively. CNP, charge neutrality point. The shape of the DOS 
drawn here is purely illustrative and does not represent the actual DOS 
profile (see Extended Data Fig. 6 for a numerical result). 
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Our data point to the presence of a spin-singlet Mott-like insulator 
ground state at half-filling and zero magnetic field (Fig. 4e). The appli- 
cation of an external magnetic field could polarize the excitations in 
the spectrum of the correlated states according to their spin. When the 
Zeeman energy exceeds the charge gap, charge conduction can there- 
fore occur (Fig. 4f). In a typical Mott insulator, the ground state usually 
exhibits antiferromagnetic spin ordering below the Néel temperature. 
However, on a triangular lattice, the frustration prevents fully antipar- 
allel alignment of adjacent spins. Possible ordering schemes include the 
120° Néel order and a rotational-symmetry-breaking stripe order*’. It 
is unclear whether the spin-singlet ground state in magic-angle TBG 
is fulfilled by either of these ordering schemes, or whether it is dis- 
ordered at low temperatures. In the half-filling states of magic-angle 
TBG it is also possible that the ordering, if any, occurs in conjunction 
with the valley degree of freedom. Therefore, any complete theoretical 
treatment of this problem should include a two-band Hubbard model 
on a triangular lattice. 

We also comment on other competing mechanisms for creating a 
half-filled insulating state in a system with flat bands. Among the possi- 
bilities, charge-density waves in two dimensions are often stabilized by 
Fermi-surface nesting, which can in principle occur near the half- 
filling of a two-dimensional Brillouin zone*!. However, this nesting 
is not sufficient to create a global gap in the entire Fermi surface to 
achieve an insulating state. To create a global gap at half-filling, at least a 
doubling of the unit cell would be necessary, which could be created by 
a commensurate charge-density wave or lattice relaxation due to strain. 
Scanning tunnelling microscopy conducted at temperatures below 4K 
may be able to differentiate such mechanisms. 

In summary, our work demonstrates that graphene can be trans- 
formed through van der Waals engineering into a flat-band system in 
which insulating states at half-filling are present. These insulating states 
cannot be explained in the absence of electron-electron interactions 
and so highlight the importance of correlations in this flat-band system. 
However, the lattice and electronic structure near magic-angle TBG 
superlattices is very complex, and further theoretical and experimental 
work is necessary to ascertain the importance of correlation effects 
fully. Through its easy gate tunability, magic-angle TBG could provide 
a way of studying the transition between a correlated metal and an 
interaction-driven insulating state, which could provide insights into 
strongly correlated materials, including high-temperature supercon- 
ductivity. The combination of spin and valley degrees of freedom ona 
triangular lattice could also give rise to other exotic quantum phases, 
such as quantum spin liquids*”. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Sample preparation. Devices D1, D2 and D4 were fabricated using a modified 
‘tear and stack’ technique'*'”””. Monolayer graphene and hexagonal boron nitride 
(10-30-nm thick) were exfoliated on SiO./Si chips and examined with optical 
microscopy and atomic force microscopy. We used a poly(bisphenol A carbonate) 
(PC)/polydimethylsiloxane (PDMS) stack on a glass slide mounted on a micro- 
positioning stage to first pick up a hexagonal boron nitride flake at 90°C. Then 
we used the van der Waals force between hexagonal boron nitride and graphene 
to tear a graphene flake at room temperature. The separated graphene pieces 
were rotated manually by an angle @ about 0.2°-0.3° larger than the desired twist 
angle and stacked together again, resulting in a precisely controlled TBG structure. 
The TBG was then encapsulated by picking up another hexagonal boron nitride 
flake on the bottom, and the entire stack was released onto a metal gate at 160°C. 
The final device geometry was defined by electron-beam lithography and reactive 
ion etching. Device D3 was fabricated using a slightly different procedure, whereby 
independent graphene flakes were stacked together. The edges of the graphene 
flakes were aligned under an optical microscope to obtain small twist angles. 
Measurements. Transport measurements were performed using a standard 
low-frequency lock-in amplifier with an excitation frequency of about 10-20 Hz 
and an excitation voltage of 100 \V, in a He-3 cryostat. The current flowing through 
the device was amplified by a current pre-amplifier and then measured by the 
lock-in amplifier. 

The capacitance was measured using a low-temperature balanced capacitance 

bridge”®. A schematic of the measurement circuit is shown in Extended Data Fig. 2a. 
The reference capacitance C,.¢ used in our experiment was approximately 40 fF, 
and the device geometrical capacitance was approximately 7 fF. The a.c. excitation 
voltage used in our measurements was 3 mV at f= 150kHz. 
Transport data in device D4. Transport measurements in devices D1 and D3 
were performed in a two-probe configuration. Although it is generally advised to 
perform four-probe measurements in transport experiments, we found that the 
existence of multiple insulating states (the superlattice gaps at -En, and the half- 
filling states at +:n,/2) frequently led to noisy or negative longitudinal resistance 
R, signals, owing to the region in the device near the voltage probes becoming 
insulating at a slightly different carrier density. In our case, where we are mostly 
interested in insulating behaviours of the order of 100kQ to 1 MQ, a contact 
resistance of at most a few kilo-ohms, which is typical in edge-contacted graphene 
devices, did not obscure the data**. We therefore believe that the two-probe data 
presented here can be trusted and provide an accurate representation of the device 
characteristics. 

In Extended Data Fig. 4b, c we show the measurements of the two-probe and 
four-probe conductances in device D4, which has a twist angle of # = 1.16° +0.02°. 
Device D4 was measured in a Hall-bar configuration so that the contact resistance 
could be removed. In this particular device, neither the superlattice insulating 
states nor the half-filling states had very high impedance (probably owing to 
disorder or inhomogeneity), and so the previously described issues with four- 
probe measurements did not occur. The four-probe and two-probe measurements 
show essentially the same features, although some weak signals appear to be better 
resolved in the four-probe measurements. 

In the four-probe data, we not only observe the half-filling states (+2 electrons 

per moiré unit cell), but we also see evidence for odd-filling insulating phases at 
+3 electrons per moiré unit cell, as a weak reduction in the conductance curve. 
The existence of insulating behaviours at integer fillings of the flat bands other than 
+2 is expected in Mott-like insulators and lends further support to our claim 
that the correlated insulating behaviour originates from the on-site Coulomb 
interaction. 
Hall measurement in device D4. We also measured device D4 in a Hall configura- 
tion (transverse resistance R,,). In Extended Data Fig. 4d, e we show the low-field 
linear Hall coefficient Ry = R,,/B and the Hall density n= —1/(eRy) versus the 
gate-induced charge density n. In a uniformly gated single-carrier two-dimensional 
electronic gas, we expect that ny =n. This is indeed what we measured in the den- 
sity range —1.3x 10" cm * to 1.3 x 10'7cm “at 0.3K. However, near the half-filling 
states n= +n,/2, the Hall density jumps abruptly from ny =n to a small value close 
to zero (but without changing its sign). Beyond half-filling, ny follows ny=n+n,/2, 
a trend that is consistent with quasiparticles that are generated from the half- 
filling states. This ‘resetting’ effect of the Hall density disappears gradually as the 
temperature is raised from 0.3 K to 10K, in agreement with the energy scale of the 
Mott-like states. At higher temperatures, the Hall density is linear with n but the 
slope is no longer one, which might be related to the thermal energy kT being close 
to the bandwidth, which could result in thermally excited carriers with opposite 
polarity reducing the net Hall effect. 

In good correspondence with the quantum oscillation data shown in Fig. 3b, 
we see the behaviours of the new quasiparticles on only one side of the Mott-like 
state, for example, the side farther from the charge neutrality point; between the 
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charge neutrality point and the Mott-like state, we see an abrupt change from the 
typical large Fermi surface of the single-particle bands to a small Fermi surface of 
the new quasiparticles. This may result if the effective mass of the quasiparticles 
on one side of the Mott-like gap is considerably greater than the other side, so 
that the oscillation and Hall effect become difficult to observe very close to the 
metal-insulator transition. 
Determining the twist angle. Accurate determination of the twist angles of the 
samples is of utmost importance in understanding the magic-angle physics. We 
used several independent methods to determine the twist angle from the transport 
data. 

First, the superlattice density n,, defined as the density that is required to fill one 
band in the superlattice, is related to the twist angle by 


nt 8O 
“A 3a? 


where A is the unit-cell area and a=0.246 nm is the lattice constant of graphene. 
At approximately 1° < @< 3°, the superlattice densities +:n, are associated with a 
pair of single-particle bandgaps at their corresponding Fermi energy'****°. The 
measured density of the insulating states of the superlattice can therefore be used 
to estimate 0 directly according to equation (1). Owing to localized states, an accu- 
rate value of n, is difficult to pinpoint at zero magnetic field, and the estimated 0 
has an uncertainty of about 0.1°-0.2°. In Extended Data Fig. 7a—d we show the 
resistivity (resistance for magic-angle device D1) for four different TBG samples 
with twist angles of @= 1.38°, 1.08°, 0.75° and 0.65°. At = 1.38° and 1.08°, the 
positions of the superlattice gaps provide an good estimate of #. However, it has 
been noted!” that the apparent resistance peaks in the transport data may not 
correspond to n, but instead to 2n,, when the twist angle is below about 0.9°-1°. 
We observed a similar phenomenon when the twist angle was as small as 0.65°. 
This complicates the determination of twist angles, because of the ambiguity of 
whether the feature observed corresponds to n, or 2n,, which can result in the twist 
angle being wrong by a factor of ./2. 

Second, we use the fact that each band edge of the mini-band structure has 
its own Landau levels!*343°, In Extended Data Fig. 7e we show the magneto- 
conductance data of device D1 (first derivative with respect to n). The Landau 
levels emanating from n,= (2.7 +0.1) x 10'2cm~? can be clearly seen, which trans- 
lates to @= 1.08° + 0.02° according to equation (1). Because the intersection points 
of the Landau levels can be determined relatively accurately (uncertainty of about 
1 x 10'!cm~’), the twist angle can be determined with an uncertainty of about 
0.02° near the first magic angle. 

Third, the effect of applying strong magnetic fields such that the magnetic 
length becomes comparable with the unit-cell size is described by Hofstadter’s 
butterfly model?”. In density space, this model is better captured by Wannier*®. 
In the Wannier diagram, the Landau levels are universally represented by 
n/ns= volo +s, where ¢ is the magnetic flux through a unit cell, v is an integer, 
and s=0 labels the main Landau fan, s = +1 is the first satellite fan, and so on. 
Adjacent Landau fans intersect when ¢/¢9 = 1/q or, equivalently, 1/B = qA/@o, 
where q is another integer. Therefore, in the experiments we expect to see Landau- 
level crossings at periodic intervals of 1/B, with the periodicity proportional to the 
unit-cell area A. This effect has been observed in other two-dimensional superla- 
ttice systems and can be used to cross-check the twist angles extracted from other 
methods*~*. In Extended Data Fig. 7f we show the magneto-transport data (first 
derivative with respect to n) of device D3 at high doping densities, plotted versus 
nand 1/B. A periodic crossing of Landau levels is observed near —9 x 10'*cm~* 
with period 0.033 + 0.001 T-!, which gives A = (1.37 + 0.04) x 10° cm? and 
@=1.12°+ 0.01°, compared to 6= 1.12°+0.02° extracted using the previous 
method (n,=(2.9+0.1) x 10'2cm7). 
Estimating the Fermi velocity from capacitance data. The measured capacitance 
is the series sum of the geometric capacitance Cyeom and the quantum capacitance 
C,. The latter is directly proportional to the DOS in TBG. Therefore, by analysing 
the quantum capacitance C, as a function of carrier density n, we can extract the 
dependence of DOS on n and subsequently deduce the Fermi velocity. 

In the zero-temperature limit, the quantum capacitance is related to the DOS 
D(E) by Cy= e’D(Ex), where Ey is the Fermi energy. In a model system for TBG 
near charge neutrality that consists of massless Dirac fermions with Fermi velocity 
vp and eight-fold degeneracy (spin, valley, layer), the DOS is*”*! 


(1) 


4 
D(Ex) = —Ep(hvy)? 
T 
Because Ep =hiveky (where kg is the Fermi wavevector) is related to the density n by 


RE 
n=8 a 22 = Er=hve, a 
(2) Tw (hvp) y| 2 
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where the factor of 8 comes from the spin, valley and layer degeneracy, the 
quantum capacitance of the TBG is 


2/2 
Cq= e? | In| + 2 
q=e Tin |n| +n (2) 


Owing to disorder, the spatially averaged DOS at the Dirac point (n = Ep = 0) 
will not be absolutely zero. Therefore, a phenomenological disorder density of 
na 1 x 10!°cm~? is added in the above expression”. 

The measured capacitance is then 


1 1 1 
a ee 4+ — 
amy cE on 


(3) 


In Extended Data Fig. 2b, we show the measured capacitance near the Dirac point 
and fitting curves according to equations (2) and (3). Cgeom is approximated by the 
d.c. gating capacitance C, + 7.5 fF. We find that using vp=0.15 x 10°m s~! and 
na=1 x 10!°cm ~? gives a reasonable fit to the data measured at both 0.3K and 2K. 
The fitting for vg is sensitive to the value used for Cyeom. For example, using a 
Cyeom Value 30% larger than the value that we used above, we find a Fermi velocity 
of vp =0.10 x 10°ms~ 1. Similarly, using a value 15% smaller than the said value 
we find vg=0.20 x 10°m s~!. Nonetheless, the analysis presented here suffices to 
demonstrate that the Fermi velocity is indeed reduced greatly in the capacitance 
device D2. The slightly larger Fermi velocity compared to that measured in the 
transport device D1 (vp=0.04 x 10°m s"!) can be attributed to the slightly larger 
twist angle of device D2 (@=1.10°), which might be farther from the first magic 
angle ae = 1.05°. 
Error bars. The error bars in the inset of Fig. 2a are computed using the following 
criteria: for the transport devices D1, D3 and D4, the endpoints of the error bars 
correspond to the points at which the conductance rises to 10% of the peak value 
on that side; for the capacitance device D2, because the peaks are very sharp (see 
Fig. 3a), the error bar corresponds to the width of the entire peak in the loss tangent 
data. 
Quantum oscillations and extracting m*. We performed magneto-transport 
measurements in device D1 from 0.3 K to 10K. At each gate voltage, a polynomial 
background of resistance in B was first removed, and then the oscillation 
frequency and the effective mass was analysed. Examples of the Shubnikov—de 
Haas oscillations and their temperature dependences at a few representative gate 
voltages are shown in Extended Data Fig. 3a—c. The temperature dependence of 
the most prominent peak is fitted with the Lifshitz-Kosevich formula applied to 
the resistance: 


_ 2n?kTm* 
heB 


AR«x —* , xX 
sinh(y) 


(4) 


where AR is the change in resistance and the cyclotron mass m” is extracted from 
the fitting (examples shown in Extended Data Fig. 3d). Within the flat bands, the 
quantum oscillations universally disappear at around 10 K except very close to the 
Dirac point, consistent with the large electron mass and greatly reduced Fermi 
velocity near the first magic angle. 

The full magneto-conductance map measured in device D1 at 0.3 K is shown 

in Extended Data Fig. 5c. At first glance, it may seem that the Landau levels that 
emanate from the Dirac point ‘penetrate’ the half-filling states and continue 
towards the band edges. However, this is not the case. In Extended Data Fig. 3e, f 
we show the same data but plotted versus 1/B instead of B. Here it can be seen that 
at densities beyond the half-filling states, the oscillations clearly do not converge 
at the Dirac point, instead converging at the half-filling states. The oscillation 
frequencies extracted from these data are plotted in Fig. 3b. 
Band structure of TBG near magic angles. The general evolution of the 
band structure of TBG above the first magic angle has been described 
previously®!4-1183435, The low-energy band structure consists of two Dirac cones 
(each is four-fold-degenerate owing to valley and spin), with a renormalized Fermi 
velocity of 


1— 3a? 


v,(9) = v 
B= Nee 


where «= w/(fvokg) is the dimensionless interlayer hopping amplitude (w and vo 
are the interlayer hopping energy and original Fermi velocity in graphene, kg Gx@ 
is the interlayer momentum difference and Gx is the wave number at the corner 
of the Brillouin zone in graphene)*!°. When a < 1, vp(0) is approximately 
vo(1 — 907). ve(@) passes through zero ata = 1 / 3, which defines the first magic 
angle ee However, to our knowledge, the detailed evolution of the band struc- 
ture near magic angles has not been addressed previously. Specifically, we seek to 


determine how the associated winding number evolves as the Fermi velocity at the 
Dirac points changes sign. Close to a generic Dirac point, the effective two-band 
Hamiltonian can be written as” 


hvy(0)kt + O(k?) 


O(k?) 
H(k) = hyv,(0)o0 .k + O(k?) = O(k?) 


~ lav p(O)k + O(K2) 


in which k= (k,, ky), k=k,+ ik, and o = (0, ay) is the vector of the Pauli matrices. 
As vp(@) — 0 near the first magic angle, the terms that are linear in k vanish and 
the dispersion is dominated by the next-leading-order, k* terms. A simple form 
for the k* term is 


he? 2 


0 hv,(0)k* + — 
2m 


H(k) = (5) 


2 
hv (Ok + (kt? 0 
2m 


in which m is a parameter with the dimension of mass. This Hamiltonian describes 
the low-energy band dispersion of monolayer graphene with third-nearest- 
neighbour hopping and of bilayer graphene with Bernal stacking and trigonal 
warping*?~“®. The eigenvalues of this Hamiltonian are 


2 
+ 


2 


(6) 
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Es(k)=+, | 


The evolution of the dispersion described by equation (6) with varying vp and 
a constant m= 0.5 is shown in Extended Data Fig. la-f. The winding number 
associated with a Dirac point is defined by 
w= & (k\Vilk)ak 
T 
c 


where C is a loop around the Dirac point. The winding number follows a conser- 
vation law when the motion and merging of Dirac points are considered”. The 
winding number of the upper band (E,(k)) at each touching point in the upper 
band is labelled in Extended Data Fig. la-f. 

As vp — 0 there exist three additional Dirac points with opposite winding 
number (—1) to the main Dirac point (+1). Therefore, at vp = 0, when all four 
Dirac points merge, the winding number is —2 because the total winding number 
cannot change. 

The simple Hamiltonian form in equation (5) is an educated guess. We performed 
numerical calculations of the winding number using the continuum model for 
TBG*!> and the numerical method in ref. 47. The results are summarized in 
Extended Data Fig. 1g-]. We find that near the first magic angle of the model used, 
Se = 1.064°, the behaviour described in Extended Data Fig. la-fis exactly what 
happens at each corner of the mini Brillouin zone. The complication that arises 
when we consider the entire mini Brillouin zone is that, for a given valley (of the 
original graphene Brillouin zone, such as K), the two inequivalent corners of the 
mini Brillouin zone have the same winding number because they are the hybridized 
result of the same valley (K) of opposite layers (see Fig. 1d). Global time-reversal 
symmetry is preserved by mapping to the other valley (K’). Therefore, for a given 
valley K, when the twist angle is reduced from large angles, at which the winding 
numbers of the two corners are (+1, +1), to the first magic angle, at which the 
winding numbers are (—2, —2), a net winding number change of Aw=6 occurs 
between the two lowest-energy bands. Further theoretical work is necessary to elu- 
cidate the physics behind this winding number evolution near the first magic angle. 

In summary, we show that at exactly the first magic angle the Dirac point at each 
corner of the mini Brillouin zone (K, and K’,) becomes a parabolic band touching 
with a winding number of —2, similarly to bilayer graphene with Bernal stacking 
except that the two corners have the same winding number. The calculation that 
corresponds to the first magic angle in Extended Data Fig. li can be fitted to a 
paraboloid, which yields an effective mass of 1.1me. This value can be viewed as 
the asymptotic limit of the effective mass near the charge neutrality point as vp — 0. 
DOS in magic-angle TBG. Despite our simplistic representation of the DOS in the 
flat bands of magic-angle TBG (Fig. 4d-f), the actual single-particle DOS profile 
of magic-angle TBG is more complex, with multiple van Hove singularities. In 
Extended Data Fig. 6 we show the DOS versus energy calculated using a continuum 
model? for 6 = 1.08°. 

Data availability. The data that support the findings of this study are available 
from the corresponding author on reasonable request. 
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Extended Data Figure 1 | Evolution of the low-energy band structure of 
TBG near the magic angle. a-f, E, dispersion as in equation (6) for 
different vp and fixed m= 0.5. The k, and k, range in the figures is [—2, 2] 
and the colour scale (on the right side of the figures) for the dimensionless 
energy axis is 0 to 10 from bottom to top. The associated winding number 
of each touching point is labelled. gl, The evolution of the low-energy 
band structure of TBG near the first magic angle ane = 1.064° in the 
model. The colour shows the hotspots of the Berry curvature at the 


ve=1.5, m=0.5 


Kv vie 
k,/Ko 


k,/Ke 


Energy 
Energy 


IN 


k,/Ke 
touching point of each band. The energy axis spans an extremely small 
range of [—50, 50] jteV. The momentum axes are measured by ky = KO and 
the range for both k,/Kg and k,/Kg is [—0.1, 0.1]. The centre of the 
momentum space is the K, point of the mini Brillouin zone (see Fig. 1d), 
and the thick lines denotes the K,-M,-K’, directions (there are three 
inequivalent ones). All results are shown for the K-valley continuum 
description of TBG®. 
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Extended Data Figure 2 | Capacitance measurement set-up and the reference capacitance. All connections into and out of the cryostat are 
extraction of the Fermi velocity. a, Schematic of the low-temperature made with coaxial cables. b, Capacitance AC of device D2 near the charge 
capacitance bridge. The X and Y outputs from the lock-in amplifier refer neutrality point, and fitting curves according to equations (4) and (5) with 
to the in-phase and out-of-phase components, respectively. C(device) and different Fermi velocities. vy = 10°m s~! is the Fermi velocity in pristine 
Reeries are the capacitance and resistance of the sample. V, is the d.c. gate graphene. 


voltage, Vq is the excitation voltage, Vier is the reference voltage and Cyer is 
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Extended Data Figure 3 | Quantum oscillations and extraction 

of the effective mass. a—c, Temperature-dependent magneto-resistance 
AR of device D1 at gate-voltage-induced carrier densities of n = 

—2.08 x 10'2cm~? (a), n= —1.00 x 10!2cm~? (b) and n=0.19 x 10! cm~? 
(c). The temperatures are, from dark to bright, 0.3 K, 1.7 K, 4.2 K and 

10.7 K. d, Oscillation amplitudes of the most prominent peaks in a-c. 

The curves are fitted according to the Lifshitz—-Kosevich formula 


-0.03 


n(10'2cm”) 


(equation (4)). e, Magneto-conductance G of device D1 (measured at 
0.3 K) plotted versus n and 1/B. f, The same data with a polynomial 
background in B removed for each density. The green boxes denote 

the range of densities for the half-filling states. At densities beyond the 
half-filling states, the oscillations do not converge at the Dirac point, but 
instead at the half-filling states. 
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Extended Data Figure 4 | Supplementary transport data in devices D3 
and D4. a, Magneto-conductance G in device D3 (9 = 1.12°) versus n and 
B. The primary features at the superlattice gaps +n, and the half-filling 
states +:n,/2 are essentially identical to those for device D1. b, c, Four- 
probe (b; G,,) and two-probe (c; G2) conductance measured in device 

D4 (0 = 1.16°) at 0.3 K. The coloured vertical bars and the corresponding 
numbers indicate the associated integer filling inside each unit cell of the 
moiré pattern. As well as the half-filling states (+2), we also observe weak 
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drops in the four-probe conductivity that point to three-quarter-filling 
states at +3. d, e, Hall measurement in device D4 at various temperatures: 
the Hall coefficients Ry (d) and the Hall density ny = —1/(eRy) (e). The 
coloured vertical bars and the corresponding numbers are as in b and c. 
The x axis is the gate-induced total charge density n, whereas the Hall 
density ny and its sign indicate the number density and characteristic 
(electron-like or hole-like) curve of the carriers being transported. 
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Extended Data Figure 5 | Supplementary transport data in device D1. 
a, Temperature dependence of the conductance G of device D1 from 0.3K 
to 300K. b, The conductance versus temperature at five characteristic 
carrier densities, labelled A+ (superlattice gaps), B+ (above and below 
the Dirac point) and D (the Dirac point) in a. The arrow denotes the 
temperature above which the conductances at B; merge with that of D. 
The solid lines accompanying the A, traces are Arrhenius fits to the data. 
The thermal activation gaps of the superlattice insulating states at A, can 
be obtained by fitting the temperature dependence of the conductance at 
these densities. See ref. 13 for a detailed discussion about the superlattice 
gaps in non-magic-angle devices. The fit to the Arrhenius formula 
exp[—A/(2kT)] yields A_ =32 meV for the A_ gap and A, = 40 meV for 
the A, gap. For comparison, the same gaps measured in 0 = 1.8° TBG are 


slightly larger, A_ =50 meV and A, =60 meV for the gaps at negative and 
positive densities, respectively’*. c, Magneto-conductance in device D1 as 
a function of gate-induced charge density n and perpendicular magnetic 
field B. d, Magneto-conductance in device D1 measured as a function of 
nand in-plane magnetic field Bj. The in-plane measurement is made at a 
higher temperature of about 2 K. Combined with the degradation of the 
sample quality that resulted from the thermal cycling that was necessary to 
change the field orientation, the half-filling states are not as well developed 
as in the previous measurements. However, the gradual suppression of the 
half-filling states is still unambiguously observed when Bj is above about 
6T, slightly higher but similar to the approximately 4—6-T threshold for 
the perpendicular field (see c and Fig. 4a, b). The red and blue arrows 
point to the p-side and n-side half-filling states, respectively. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


200 


DOS (eV ‘nm”) 
3S 
oO 


DOS (eV nm?) 


' 
1 
i 
i] 
i] 
i] 
i] 
1 
i] 
I 
i] 
1 
1 
i] 
I 
i] 
i} 
1 
1 
! 
1 


LETTER 


E (meV) 


0 


Extended Data Figure 6 | DOS in magic-angle TBG. a, b, Single-particle DOS in TBG at @ = 1.08°, on linear (a) and logarithmic (b) scales. The red 
dashed lines denote the energy at which the lower and upper flat bands are half-filled. The results are obtained numerically using a continuum model’. 
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Extended Data Figure 7 | Determining the twist angle. a—d, Resistivity 
Pxx (resistance R for the 6 = 1.08° device) measurements for four samples 
with different twist angles: 0 = 1.38° (a), 0=1.08° (b), 0=0.75° (c) and 
0=0.65° (d). The filled arrows highlight superlattice features at --n, and 
open arrows highlight +2n, features that may correspond to features 
reported in ref. 17. So far, we have observed the half-filling states only in 
devices that have twist angles within 0.1° of the first magic angle. 
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device D1 (8 = 1.08°) measured at 4K. The dashed lines label the main 
(green) and satellite (blue) Landau fans. From the convergence point of the 
blue fans, we can accurately determine the superlattice density n, and thus 
0, with an uncertainty of about 0.02°. f, Hofstadter’s oscillation manifested 
as periodic crossings of Landau levels in 1/B. Data shown is the magneto- 
conductance (derivative with respect to n; dG/dn) of device D3 (9=1.12°). 
The horizontal lines have a uniform spacing of 0.033 + 0.001 T~!, which 
corresponds to = 1.12°+0.01°. 
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Room-temperature nine-,zm-wavelength photo- 
detectors and GHz-frequency heterodyne receivers 


Daniele Palaferri!, Yanko Todorov!, Azzurra Bigioli!, Alireza Mottaghizadeh!, Djamal Gacemi!, Allegra Calabrese, 
Angela Vasanelli!, Lianhe Li?, A. Giles Davies’, Edmund H. Linfield’, Filippos Kapsalidis®, Mattias Beck*, Jérome Faist? & 


Carlo Sirtori! 


Room-temperature operation is essential for any optoelectronics 
technology that aims to provide low-cost, compact systems for 
widespread applications. A recent technological advance in this 
direction is bolometric detection for thermal imaging!, which 
has achieved relatively high sensitivity and video rates (about 
60 hertz) at room temperature. However, owing to thermally 
induced dark current, room-temperature operation is still a 
great challenge for semiconductor photodetectors targeting the 
wavelength band between 8 and 12 micrometres’, and all relevant 
applications, such as imaging, environmental remote sensing and 
laser-based free-space communication**, have been realized at low 
temperatures. For these devices, high sensitivity and high speed 
have never been compatible with high-temperature operation®’. 
Here we show that a long-wavelength (nine micrometres) infrared 
quantum-well photodetector® fabricated from a metamaterial 
made of sub-wavelength metallic resonators” |” exhibits strongly 
enhanced performance with respect to the state of the art up to 
room temperature. This occurs because the photonic collection 
area of each resonator is much larger than its electrical area, thus 
substantially reducing the dark current of the device!*. Furthermore, 
we show that our photonic architecture overcomes intrinsic 
limitations of the material, such as the drop of the electronic drift 
velocity with temperature'*’*, which constrains conventional 
geometries at cryogenic operation®. Finally, the reduced physical 
area of the device and its increased responsivity allow us to take 
advantage of the intrinsic high-frequency response of the quantum 
detector’ at room temperature. By mixing the frequencies of two 
quantum-cascade lasers'® on the detector, which acts as a heterodyne 
receiver, we have measured a high-frequency signal, above four 
gigahertz (GHz). Therefore, these wide-band uncooled detectors 
could benefit technologies such as high-speed (gigabits per second) 
multichannel coherent data transfer!” and high-precision molecular 
spectroscopy'®. 

An unexploited intrinsic property of inter-subband quantum-well 
infrared photodetectors (QWIPs) based on group 11I-v semiconductor 
materials is the very short lifetime of their excited carriers. The typical 
lifetime is of the order of a few picoseconds’, which has two important 
consequences: the detector frequency response can reach 100 GHz 
and its saturation intensity is very high’? (10? W cm ~*). These prop- 
erties are ideal for a heterodyne detection scheme in which a power- 
ful local oscillator can drive a strong photocurrent (higher than the 
dark current of the detector) that can coherently mix with a signal 
shifted in frequency with respect to the local oscillator. Notably, these 
properties are unobtainable in infrared inter-band detectors based on 
mercury-cadmium-telluride (MCT) alloys, which have a much longer 
carrier lifetime and therefore a lower-speed response*””!. However, 
the performance of all photonic detectors is limited by their high dark 
current, which originates from thermal emission of electrons from 
the wells and rises exponentially with temperature, thus requiring 


cryogenic operation (about 80K) for high-sensitivity measurements. 
Highly doped” (about 10’? cm~’), photovoltaic? 10-j1m-wavelength 
QWIPs and quantum cascade detectors“ with a large number of quan- 
tum wells have been observed to operate up to room temperature, 
but only when illuminated with powerful sources, such as CO, or 
free-electron lasers. 

In this work, we show that this intrinsic limitation in QWIP detectors 
can be overcome through the use of a photonic metamaterial. We are 
able to calibrate our detector at room temperature using a blackbody 
emitting energy of only hundreds of nanowatts, orders of magnitude 
smaller than that required previously. Until now, room-temperature 
performance comparable with that reported here has only been 
demonstrated in the 3-5 1m wavelength range, using quantum cascade 
detectors”*° and standard MCT detectors”’. 

The photonic metamaterial structure of our detectors is shown in 
Fig. la. The GaAs/AlGaAs QWIP® contains Ngw=5 quantum wells 
absorbing at a wavelength of 8.9|1m (139 meV) and has been designed 
according to an optimized bound-to-continuum structure described 
in ref. 7. The absorbing region is inserted in an array of double-metal 
patch resonators” !*, which provide sub-wavelength electric field con- 
finement and act as antennas. The resonance wavelength is defined by 
the patch size s according to \=2sneg, where neg= 3.3 is the effective 
index®. The structures with s= 1.3|1.m are therefore in resonance with 
the peak responsivity of the detector. 

In our structure, the microcavity increases the responsivity of 
the device by enhancing the local field in the thin semiconductor 
absorber!”. In addition, the antenna effect extends the photon collec- 
tion area of the detector, A.o, making it much larger than the electrical 
area o=s" of the device!’. Because the detector photocurrent is pro- 
portional to A,o, whereas the dark current is proportional to a, for the 
same number of collected photons there is a substantial reduction of 
the dark current with respect to the case 7 =A.oy, which results in a net 
increase of the operating temperature of the detector. 

Besides the collection area A.oy, which defines the absorption 
cross-section per patch resonator, another crucial parameter is the con- 
trast C of the reflectivity resonance (Fig. 1b). This parameter quantifies 
the fraction of the incident photon flux that is absorbed collectively 
by the array. As shown in Fig. 1c, the contrast can be adjusted!” by 
changing the array periodicity p. Optimal detector responsivity is 
obtained at the critical coupling point, C= 1, where all incident radia- 
tion is coupled into the array. The collection area per patch is related to 
the contrast according to the expression Ao = Cp’&, where the factor 
€=0.7 takes into account the polarizing effect of the connecting wires 
(see Methods)!’. From the data in Fig. 1c, critical coupling is obtained 
with a period p = 3.3 1m, which corresponds to a collection area of 
Aco = 7-5 um”, four times larger than the electrical area, o = 1.7 um’, 
of the patch. 

The device fabrication process was optimized to generate 
current solely under the square metallic patches, and not below the 


Laboratoire Matériaux et Phénoménes Quantiques, Université Paris Diderot, Sorbonne Paris Cité, CNRS-UMS 7162, 75013 Paris, France. 2School of Electronic and Electrical Engineering, 
University of Leeds, LS2 9JT Leeds, UK. 3ETH Zurich, Institute of Quantum Electronics, Auguste-Piccard-Hof 1, 8093 Zurich, Switzerland. 


5 APRIL 2018 | VOL 556 | NATURE | 85 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


1.0 -— 


s 
°o 
° 


a a ae Figure 1 | Device concept. a, Double-metal 
o.38b ; | patch antenna, with the various metallic layers 
Photon collection area Pel . a6: | employed for electrical contacts (see Methods). 
Aco , a The absorbing region contains a 386-nm-thick 
_ 5 Of ir |  QWIP structure with five quantum wells doped 
= ol o2+ 4 — with Siat a concentration of n=7 x 10!! cm™. In 
3 0.0 this metamaterial structure, the photon collection 
2 12+ + area, Acou, is much larger than the electrical area, 
@ 0.47 10b es | a.b, Reflectivity spectrum (blue curve) of a patch 
< 8b © 4 antenna array with patch size s= 1.30pm 
oe. 6b ° 4 and period p=3.30m. The dashed line is a 
: 4b | Lorentzian fit used to obtain the absorption 
2 be nnbaneennc 7m | contrast C. ¢, d, Contrast C (c) and collection area 
0.0 Qi bsceaatt ererer Acou (d) as a function of the unit-cell area © = p” 
50 100 150 200 250 10 100 


Photon energy (meV) 


150-nm-wide leads connecting them. To this end, we realized ohmic 
contacts between the patches and the semiconductor layers under- 
neath them using an annealed PdGeTiAu alloy, whereas a Schottky 
barrier, made by depositing TiAu on the semiconductor surface, 
prevents the generation of vertical currents between the metallic wire 
and the semiconductor. Moreover, all cavities were connected with an 
external wire-bonding pad insulated by an 800-nm-thick Si3N, layer 
(see Methods). Thanks to all these precautions, the conductive area was 
reduced to the sum of the areas of the patch resonators, preventing the 
flow of additional dark current across the device. 

To quantify the detector performance, we compared the detector 
array with a reference device, called ‘mesa, with the same absorbing 
region processed into a 200-jsm-diameter circle and with 
light coupling through a 45° polished substrate edge’. This 
comparison revealed the intrinsic photoresponse of the detector (see 
Methods). In Fig. 2a we compare the peak responsivities for the two 
configurations, obtained with a calibrated blackbody source at 
1,000°C (see Methods). The mesa device could be characterized 
only up to 150K because the photocurrent becomes undetectable 
at higher temperatures. The array detectors showed a sevenfold 
enhancement of the responsivity at low temperatures compared with 
the mesa device and could be characterized up to room temperature, 
where their responsivity (0.2 A W_!) was comparable with the best 
responsivity of the mesa device, measured at around 50 K. We were 
thus able to record photocurrent spectra up to room temperature 
(Fig. 2b) which was, to our knowledge, the first such measurement 
with a QWIP operating in the 9-|1m-wavelength band using a thermal 
source. 

By quantifying the number of photons absorbed in each configuration 
(see Methods), we were able to extract the photoconductive gain, g, 


3.0 F 


of the array. The observed saturation of A,oy is in 


Unit-cell area, 2: agreement with theoretical predictions”*. 


for each structure (Fig. 2c). This gain represents the number of electrons 
circulating per photon absorbed in the quantum wells””* and is an 
intrinsic property of the absorbing region. All our devices (patch 
antennae nearly resonant with the detector absorption and the mesa 
device; see Methods) show the same gain as a function of temperature, 
irrespective of their fabrication geometry, which proves that the material 
properties are identical for all structures. The photoconductive gain 
is proportional to the electron drift velocity in AlGaAs barriers’ 
and its temperature dependence is linked to microscopic scattering 
processes in polar materials'*!°. Our results agree with the temperature 
dependence of the drift velocity described in ref. 14. The derived 
low-temperature value of the drift velocity is about 6 x 10°cm s~!, as 
expected in an electric field of 20kV cm! for an Al concentration of 
20%-30%". These results account for the drop in the responsivity with 
increasing temperature observed in Fig. 2a. Above 200K, the gain g 
has an almost constant value of 0.2-0.25, of the order of 1/Ngw. This 
implies that photoexcited electrons can travel only from one well to an 
adjacent one, as the mean free path of the electrons is now shorter than 
the distance between two wells. Interestingly, in this transport regime, 
a detector based on a single quantum well would be advantageous 
at high temperatures. These results illustrate how our device gives 
access to the high-temperature physics of quantum detectors, a regime 
unexplored so far. 

The detector performance is best evaluated through the specific 
detectivity’ D* =R./Aaa /./4egl , where R is the responsivity, A get is 
the area of the detector, e is the electron charge and J is the photocur- 
rent, which is plotted in Fig. 3a for the mesa reference and for the patch 
devices. The experimental results are compared with those of our 
model, which describes the influence of the photonic design on the 
specific detectivity as a function of the temperature’. In Fig. 3b we 
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Figure 2 | Detector characterization. a, Peak responsivity of QWIP 
devices, measured with a calibrated 1,000 °C blackbody source. The 
devices were fabricated in the geometries of the 200-j1m-diameter mesa 
reference device (circles) and resonator arrays with patch sizes s= 1.35 1m 
(squares) and s = 1.301m (triangles). b, Photocurrent spectra of the 
1.30-|1m-patch array at 78 K, 200 K and 295 K, normalized to unity. 
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A constant offset of unity has been applied for clarity. c, Photoconductive 
gain and electronic drift velocity of the three devices presented in a as a 
function of temperature, for 0.5 V bias voltage (electric field, 21 kV cm~!), 
The drift velocity is obtained using a quantum-well capture time of 5 ps 
(see ref. 7 and Methods). The data~symbol correspondence is the same 

as ina. Error bars represent a systematic error from our calibration set-up. 
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Figure 3 | Specific detectivity as a function of the temperature. 

a, Specific detectivity D" (27 field of view) as a function of the temperature 
at a bias of 0.5 V for the mesa reference (circles) and two array structures 
with s= 1.30 1m (triangles) and s = 1.35 1m (squares). The red line is a fit 
of the reference data using D°(T) = do/[1-+d) Texp(—Eact/kgT)]!”, where T 
is the temperature, do and d, are fitting parameters, E..¢= 120 meV is the 
activation energy and kg is the Boltzmann constant. The blue curve 
represents the model of quantum detectors embedded in patch resonators 
described in ref. 13. b, Ratio of the specific detectivities of the cavity array 
with s= 1.3 1m and the reference mesa. Dots show the corresponding 
background-limited infrared performance BLIP temperatures: Thr 1p = 70K 
(mesa) and T9117 = 83K (patch cavity arrays). 


provide the ratio between the specific detectivities of the two configu- 
rations. At low temperature, we observe an enhancement by a factor of 
only two. Here, the dark current is negligible and the main source of 
noise is the background photocurrent induced by the 300 K blackbody 
of the environment. In this regime, higher responsivity means also 
higher background noise, and the specific detectivity enhancement 
scales with the square root of the responsivity ratio (Rarray/Rmesa) 276, 
The situation is totally different at high temperature, where the dark 
current is the dominant contribution to the noise. In this case, the 
specific detectivity enhancement is: 
(Retrcy/Rinea) Asaf oy” x14 (1) 
and the performance of the arrays at 300 K is equivalent to that of 
the mesa reference at 150 K. This doubling of the operation temperature 
is a considerable improvement, well beyond that predicted from the 
low-temperature performance of the device. This is due to both 
the enhancement of the responsivity and the strong suppression of 
the dark current owing to the antenna effect, expressed by the factor 
(Acot/o)"?, because the combination of the microcavity and the antenna 
effect slows down the decrease of the specific detectivity with temper- 
ature, pushing the detector operation to much higher temperatures 
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than expected'*. By employing these concepts, we achieved high- 
temperature operation with relatively high sensitivities. 

Owing to its inherent high-frequency response and its reduced 
electrical capacitance, our device can be used as a heterodyne receiver. 
In this case, by increasing the power of the local oscillator, we can 
achieve the highest heterodyne sensitivity limited only by the detector 
absorption coefficient. This realization is depicted in Fig. 4a, where 
we show a schematic of the heterodyne arrangement that we used to 
probe our detector at room temperature. The setup consists of two 
single-mode distributed feedback (DFB) quantum-cascade lasers 
(QCLs)!° operating at \= 8.36 1m. The lasers, used as the signal 
and the local oscillator, are made collinear by a beam splitter, before 
impinging on the detector. The latter is wire-bonded with a high- 
frequency coaxial cable that is connected with a spectrum analyser. 
Each laser has a linewidth of the order of one megahertz when the 
current and temperature are stabilized. By adjusting the temperature 
of each laser, their frequencies are tuned to within a few gigahertz 
(see Methods). 

When the detector is illuminated by both lasers, a clear hetero- 
dyne signal appears on the spectrum analyser. In Fig. 4b we showa 
measurement at 1.06 GHz, with a signal-to-noise ratio of 40 dB. We 
measured heterodyne signals up to 4.2 GHz, as illustrated in Fig. 4c. 
Our bandwidth was limited by a strong impedance mismatch 
between the detector and the external circuit. In Fig. 4d we show 
the sensitivity of the heterodyne receiver at room temperature. The 
blue dots correspond to the direct-current (d.c.) saturation curve 
for the local oscillator, while the red symbols represent the hetero- 
dyne signal at 1 GHz as a function of the signal power. The solid 
blue line is a linear fit for the local oscillator saturation curve. The 
results show that the detector responds linearly up to 78 mW (about 
3.1kW cm ”) of incident power. Moreover, the linear fit intercepts the 
1-Hz integration band at a power of about 0.5 nW, in good agreement 
with the measured room-temperature specific detectivity shown in 
Fig. 3a. As can be observed from Fig. 4d, the heterodyne data are well 
fitted with a line describing a square-root dependence on the power 
(dashed red line) and can reach a signal-to-noise ratio of unity for 
an incident power of a few picowatts and an integration time of the 
order of 10 ms. These results indicate that the heterodyne technique 
could achieve a sensitivity at \=9 1m, which is unreachable by any 
other method at room temperature. We note that in our experiment 
the photocurrent induced by the local oscillator, I,o +0.5 mA, is still 
dominated by the detector dark current, Igark 3.5 mA. By increasing 
the power of the local oscillator or decreasing the temperature of 
the detector by a few tens of degrees using thermo-cooled elements, 
these detectors could reach the heterodyne detection limit, which is 
defined by their absorption efficiency”? and the relative intensity 
noise of the local oscillator”. 

We have demonstrated metamaterial photonic detectors operating at 
room temperature with high sensitivity in the 8-12 1m infrared atmos- 
pheric window. Although our detectors show lower d.c. specific detec- 
tivity than microbolometers, they have an extremely fast frequency 
response of tens of gigahertz. In addition, when installed on Peltier 
elements, the d.c. specific detectivity of our devices is comparable with 
that of uncooled microbolometers. Using a QCL as a local oscillator, we 
implemented a heterodyne detection setup and confirmed that these 
uncooled detectors can operate as coherent heterodyne receivers up to 
4.2 GHz. The heterodyne scheme has tremendous potential for sensitive 
detection in the mid- to far- infrared and could outperform all other 
competing technologies. 

Our devices could be implemented for the detection of coherent 
signals (lasers), in particular, for free-space high-data-rate trans- 
fer!’ and dual-comb spectroscopy”, which is an emerging high- 
resolution spectroscopic technique that requires high-speed detectors. 
In addition, well established applications, such as optical free-space 
communications, thermal imaging and environmental remote sensing, 
will greatly benefit from our coherent sensitive detection. Moreover, 
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our heterodyne scheme could also be used for the generation and syn- 
thesis of microwaves (up to a few hundreds of gigahertz) with good 
efficiency, of the order of a few per cent. Finally, these coherent detec- 
tors are ideal for implementation in photonic integrated circuits in 
which a local oscillator is combined with a heterodyne receiver. 
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METHODS 


QWIP fabrication. The QWIP structure is grown by molecular beam epitaxy. It 
consists of five GaAs quantum wells, each with a thickness of Lay =5.2 nm and 
n-doped across the central 4-nm region with Si at a density of Ni=1.75 x 10'8cm~3, 
providing a sheet density of n=7 x 10!!cm~*. The quantum wells are separated by 
AljsGazsAs barriers of thickness L, = 35 nm. At the top and bottom of this periodic 
structure, GaAs contact layers are grown, with thicknesses Letop = 100.0 nm 
and Le bottom = 50.0nm and doping densities Na,top = 4.0 x 10!8cm~? and 
Napottom = 3.0 x 10'8.cm~3, respectively. The double-metal structures are obtained 
by wafer-bonding on a GaAs host substrate using 500-nm-thick gold layers and 
by selectively etching down to an etch-stop AlgsGa35As layer grown below the 
bottom contact. As shown in Fig. la, the patch antennae are connected by thin, 
150-nm metallic wires fabricated using electron-beam lithography (consecutive 
alignments of lithography steps enable the fabrication of contacts from different 
metallic alloys). The final structure is obtained by inductively coupled plasma 
etching of the semiconductor region between the antennae. The Schottky barriers 
under the thin metallic wires prevent vertical dark current flow between the metal 
and the semiconductor*. The 45°-facet substrate-coupled geometry consists of a 
200-|1m-diameter circular mesa, with annealed consecutive layers of Pd, Ge, Ti and 
Au (deposited by metal evaporation) as a top contact, and annealed Ni, Ge, Au, Ni 
and Au layers as a diffused bottom contact. 

Extended Data Fig. 1 shows a scanning electron microscope image of the 

quantum detector device built on the basis of our metamaterial photonic concept. 
The pixel size of the device is 50 x 50,1m?. The external pad is connected with the 
array by 150-nm-thick wires and is insulated from the bottom ground plane by 
an 800-nm-thick Si;N, layer. The TiAu pad connects the device with the external 
circuit by wire bonding. 
Reflectivity and photocurrent analysis. Reflectivity and photocurrent spectra 
were obtained using a Bruker Vertex interferometer. Reflectivity measurements 
were performed at a 15° incident angle and at room temperature, and the incident 
light was polarized perpendicular to the 150-nm-thick connecting wires. To obtain 
the photocurrent spectra, the QWIP devices were mounted in a cryostat with 
an internal cooled metallic shield and a ZnSe optical window. The photocurrent 
and responsivity were measured using a blackbody source at 1,000°C, which was 
calibrated with an MCT detector. The source was focused onto the detector by two 
gold parabolic mirrors with f/1 and f/3 (which denote the ratio between the focal 
length and the diameter, 5.08 cm, of the parabolic mirrors), providing a typical 
field of view of 60°. The photocurrent was measured with a lock-in technique using 
an optical chopper at 1,059 Hz and a shunt resistance connected with the voltage 
input of a lock-in amplifier (Stanford Research SR1830) without any pre-amplifiers. 
Light-polarization dependence. Our structures support two fundamental modes, 
TMjo9 and TMoo, which are presented in Extended Data Fig. 2a. This figure shows 
the vertical component E, of the electric field in the plane of the resonator, as 
obtained through finite-element simulations. The electric field distribution follows 
a standing-wave pattern, with a node at the centre of the square and maxima at the 
edges. The connecting wires perturb the TMoi mode slightly, which results in a 
lower coupling efficiency for this mode. As a result, the total photoresponse of the 
antenna-coupled device has a co-sinusoidal dependence on the light polarization 
of a normally incident wave. 

In Extended Data Fig. 2b, we plot the peak value of the photocurrent for a 
structure with s= 1.30 1m as a function of the polarization of a plane wave incident 
on the array (open circles), with the 90° direction corresponding to the direction 
of the connecting wires. The angular integral of the cavity photocurrent 


peak Ipnoto(9) plotted in Extended Data Fig. 2b gives a polarization coupling 
2a 


coefficient = J Tproto(0)40 = 71%. The contrast value C of the TM 99- 
0 


polarized light is obtained from the measurement of Fig. 1b. For comparison, in 
the same graph we also plot the polarization dependence of the photoresponse 
measured for the mesa geometry (open squares). Here the 0° direction corresponds 
to the growth direction of the quantum wells, and the incident wave propagates 
normally towards the 45° polished facet. This polar plot therefore recovers the 
inter-subband selection rule, as expected’. 

Definition of the collection area. Because all incident radiation that is not 
absorbed is reflected, the contrast C provides the ratio between the incident (P,) 
and absorbed (P,) fluxes for each patch, C= P,/P;, directly. If ®; is the incident 
photon flux, then the power received by each antenna is P,= @;p” and the power 
absorbed is P, = £;A¢ou. Then, using the definition of C we obtain Aco = Cp”. As 
noted in the main text, we also add a corrective factor of array =0.7 owing to the 
polarizing effect of the wires, described in the previous section. 

Responsivity, gain and specific detectivity. In Extended Data Fig. 3a we show 
the responsivity curves as a function of voltage for the mesa and the patch cavity 
with s=1.35,1m. The decrease of the responsivity with temperature is attributed 


LETTER 


to the thermal dependence of the charge-carrier drift velocity and to an increased 
phonon-electron interaction’ (see Fig. 2c). The QWIP devices show a typical 
negative differential photoconductivity, known as the Gunn effect, which involves a 
decrease of the photocurrent as a function of voltage in electric fields with specific 
critical intensities, at which inter-valley electron scattering is induced in GaAs’. 

The responsivity of the mesa can be calculated by considering the voltage 
(V) dependent photoconductive gain g(T,V) of the active region of the detector 
and the peak inter-subband energy E>; = 143 meV, and by including many-body 
effects: 


Rmesa(E21, T, Vv) = Nigp(E21) eg(T, V) tcaAs&mesa/ E21 (2) 


where nish = 5.0% is the absorption coefficient for the five-quantum-well system 
in the 45°-facet geometry, tGaas = 0.67 is the substrate transmission coefficient at 
8.6j1m and Emesa = 0.5 is the polarization factor (only one polarization of the 
incident light is coupled with the 45° facet). Analogously to equation (2), we can 
define): 


eg(T, V)CE. 
En, 


Bigp(E21) 
Bisp(Ex1) + Qi + Qh 


array 


Rarray(E21, ag Vv) = (3) 


where Qohm=4 and Qyaq = 22 represent the ohmic and radiative dissipation of the 
double-metal cavity, respectively, obtained by reflectivity measurements. Indeed, 
the Lorentzian fit of the reflectivity resonance shown in Fig. 1b provides its full- 
width at half-maximum and the sum Qin + Q.'p and Q,aq is calculated from the 
analytical expression given in ref. 13. 

The dimensionless parameter Bj, quantifies the energy dissipation through 
inter-subband absorption and is expressed by a Lorentzian lineshape: 
Ea AD 


(4) 
4Ey (E— En)? + (Al)? /4 


Bisp(E) =f, 


where fy = NgwLqw/L = 0.067 is the filling factor of the absorbing quantum wells 
along their entire thickness L, E,; = 47.2 meV is the inter-subband plasma energy 
and [°= 15.0 meV is the full-width at half-maximum of the photoresponse of the 
mesa, obtained by a fit to the experimental data. We obtain a similar value 
Bisp = 0.07 for the two resonance cavities with s = 1.30,1m and s= 1.35 1m. The 
absorption coefficient in the antenna-coupled QWIPs is described by the branch- 
ing ratio Narray = Bish /(Bisb + Qin + om ) = 18.9%. Using equations (2) and (3) 
with the measurement data shown in Fig. 2a, we obtain very similar values for the 
photoconductive gain of the mesa and the array, as shown for the data acquired at 
0.5 V (21kV cm ') in Fig. 2c. This confirms that the absorbing regions in the two 
geometries are identical. Furthermore, the data show an exponential decrease of 
the gain as a function of temperature. According to ref. 7, the photoconductive 
gain can be defined as: 


$a. 7. (5) 


where Tcapt= 5 ps is the capture time, vq is the drift velocity, Ngw =5 is the number 
of quantum wells and L, = 40.2 nm is the length of a period in the structure. The 
thermal dependence of the gain is related directly to the drift velocity and there- 
fore to the electron mobility. According to ref. 14, we can express the temperature 
dependence of the photoconductive gain as: 


1 


T)= 
gl ) P a 2 (2 y" (6) 
8 —- exp(ELo / kT) kal 


Here E,o = 36 meV is the longitudinal optical phonon energy in GaAs, and the 
fitting parameter gp = 1.25 + 0.03 is the gain at equilibrium (without thermal 
scattering dependence), with the error obtained from the statistical variance of 
the fit. The second term in the denominator represents polar optical scattering 
(see ref. 15), where the parameter B= 24.4 + 1.6 is a dimensionless polar constant. 
The third term represents the deformation potential scattering caused by inter- 
action of carriers with acoustic phonons, and the corresponding parameter 
E,.=0.07 £0.01 meV characterizes the acoustic deformation potential. Equation (6) 
fits the experimental data very well, confirming the model. 

The photoconductive gain values obtained in this way are used to calculate the 
specific detectivity as a function of applied voltage at different temperatures, as 
illustrated in Extended Data Fig. 3b. 

Heterodyne measurement. The two beams from the QCLs are made collinear using 
£/0.5 germanium lenses and a beam splitter and then focused onto the detector 
by an f/1.5 lens and a \/4 waveplate to avoid optical feedback (Fig. 3a). The two 
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lasers are d.c. biased with a voltage supply and are mounted in two Janis cryostats 
to stabilize their temperatures using a liquid nitrogen flow. The QWIP is polarized 
by a Keithley 2450 sourcemeter and the heterodyne signal is sent to a spectrum 
analyser (Agilent E4407B) using a bias tee. In this arrangement the QWIP detector 
is at room temperature, without using a cooling system. The QCL used as the local 
oscillator is kept at a temperature of 254 K and that used for the signal at 293 K. 
With the temperature stabilized, it is possible to tune the spectral position of the 
two DFBs by slightly changing the applied d.c. current according to the tuning coef- 
ficients 3,9 =378 MHz mA! and 3s= 413 MHz mA“! (extracted from a linear 
fit to the emission frequency of the lasers as a function of temperature and bias). 

In the case of a high-power local oscillator, the noise-equivalent power of the 
heterodyne receiver can be written” as NEPhet = E2\/()7), where 7 is the absorption 
coefficient of the QWIP and 7 is the integration time (defined by the integration 
bandwidth Afas t = 1/Af). For our microcavity-array device we have a theoretical 
limit of NEPher of less than 1 aW for an integration time of T= 1 s at 300K. In the 
experiment shown in Fig. 4, the signal-to-noise ratio is still mainly limited by the 
dark current. The square-root fit of the signal-to-noise ratio can be extrapolated to 1, 
which provides NEPhet* 10 fW for an integration time of 1s (NEPhet 1 pW for 
an integration time of 10 ms), which is still four orders of magnitude higher than 
the theoretical limit. These estimations indicate that a high-power local oscillator 
could achieve sensitivities at the single-photon level at room temperature. 


Linearity and heterodyne measurement. In Extended Data Fig. 4 we show the 
spectra of the two QCLs compared with the room-temperature response of the 
QWIP with the microcavity-array configuration. The lasers are detuned from the 
maximum inter-subband absorption, resulting in a detector photoresponse that is 
half of the maximum achievable. This is an important observation because the 
responsivity and specific detectivity values that we report in Figs 2 and 3 corre- 
spond to the peak values of the detector photoresponse. The background-limited 
noise-equivalent power is defined as NEP = [Adet /D*. The detector area Adet 
corresponds to the 50 x 501m? area of the whole array, which is equal to the 
number of patches Npatch Multiplied by the unit-cell area X’= p’ of the array. Indeed, 
in the critical coupling point, all incident radiation is absorbed by the array, and 
therefore the collection area A¢oy of each patch coincides with X’. Using the 
measured specific detectivity at 295 K for the cavity with s= 1.30,1m at 0.5 V 
(Fig. 3), we have D® =2.8 x 10’cm Hz®> W~! and NEP =0.2nW Hz~°°. Taking 
into account the 50% spectral overlap, we obtain NEP=0.4nW Hz °°, 
which agrees with the value obtained from the linearity measurement shown in 
Fig. 4d. 

Data availability. All data supporting the findings of this study are available within 
the paper and the Extended Data files. 


32. Sze, S. M. & Kwok, Ng. Physics of Semiconductor Devices (Wiley, 2011). 
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Extended Data Figure 1 | Global view of the device. Scanning electron microscope image of mid-infrared QWIP structure embedded into a 50 x 501m? 
array of patch resonators. The top TiAu contact is evaporated onto an 800-nm-thick Si;N, insulating layer. 
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Extended Data Figure 2 | Polarization dependence of the 
photoresponse. a, Finite-element simulation of the E, component of the 
electric field coupled with the patch-cavity QWIP for the TMjo9 and the 
TMo1o modes. The colour scale represents the amplitude of the electric 
field and ranges from the absolute maximum (red) to the maximum 
amplitude in the opposite field direction (blue). b, Polar graph of the cavity 


Photocurrent 
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photocurrent peak as a function of the wire-grid polarization angle. The 
photocurrent is normalized by its maximum value, at 0°. The open circles 
are the results for the cavity array, where the 90° direction corresponds 

to the connecting wires. The open squares are the results for the mesa 
geometry, where the 0° direction corresponds to the growth direction of 
the quantum wells. 
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Extended Data Figure 3 | Mesa and cavity-array detector characteristics. _ is indicated for each measured curve. b, Specific detectivity for the mesa 
a, Responsivity of the mesa and the s = 1.35 1m antenna-coupled devices and the microcavity devices as a function of the applied bias at different 
as a function of applied voltage. The temperature (in kelvin) of the QWIP temperatures. 
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Extended Data Figure 4 | Spectral characteristics of the two lasers and emission lines. QCL0 was operated at 330 mA and a stable temperature 
the QWIP detector. a, Emission spectra of the QCLs (QCLyo and QCLs), of 293 K, and QCLs was operated at 280 mA and a stable temperature of 
compared with the room-temperature response of the microcavity QWIP. 254K. 


b, Magnification of the spectrum of a, showing the two distinct QCL 
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Molecular nucleation mechanisms and control 
strategies for crystal polymorph selection 


Alexander E. S. Van Driessche!, Nani Van Gerven®*®, Paul H. H. Bomans””, Rick R. M. Joosten2*, Heiner Friedrich?, 


David Gil-Carton‘, Nico A. J. M. Sommerdijk? & Mike Sleutel>° 


The formation of condensed (compacted) protein phases is 
associated with a wide range of human disorders, such as eye 
cataracts!, amyotrophic lateral sclerosis”, sickle cell anaemia’ and 
Alzheimer’s disease*. However, condensed protein phases have 
their uses: as crystals, they are harnessed by structural biologists 
to elucidate protein structures’, or are used as delivery vehicles for 
pharmaceutical applications®. The physiochemical properties of 
crystals can vary substantially between different forms or structures 
(‘polymorphs’) of the same macromolecule, and dictate their 
usability in a scientific or industrial context. To gain control over 
an emerging polymorph, one needs a molecular-level understanding 
of the pathways that lead to the various macroscopic states and of 
the mechanisms that govern pathway selection. However, it is still 
not clear how the embryonic seeds of a macromolecular phase are 
formed, or how these nuclei affect polymorph selection. Here we 
use time-resolved cryo-transmission electron microscopy to image 
the nucleation of crystals of the protein glucose isomerase, and to 
uncover at molecular resolution the nucleation pathways that lead to 
two crystalline states and one gelled state. We show that polymorph 
selection takes place at the earliest stages of structure formation 
and is based on specific building blocks for each space group. 
Moreover, we demonstrate control over the system by selectively 
forming desired polymorphs through site-directed mutagenesis, 
specifically tuning intermolecular bonding or gel seeding. Our 
results differ from the present picture of protein nucleation’ '”, in 
that we do not identify a metastable dense liquid as the precursor 
to the crystalline state. Rather, we observe nucleation events that 
are driven by oriented attachments between subcritical clusters 
that already exhibit a degree of crystallinity. These insights suggest 
ways of controlling macromolecular phase transitions, aiding 
the development of protein-based drug-delivery systems and 
macromolecular crystallography. 

How do protein crystals nucleate? What is or are the pathway(s) 
from isolated protein molecules to mesoscopic and finally macroscopic 
crystals? There have been three independent nanometre-scale obser- 
vations of protein nucleation at solid—liquid interfaces!*"!°, revealing 
both direct and indirect pathways, but these works used atomic force 
microscopy—a surface technique that is blind to events taking place 
within the liquid. In another approach, in situ liquid-cell transmission 
electron microscopy was used to map the nucleation pathways of cal- 
cium carbonate’ and, more recently, of the protein lysozyme!’ but that 
technique currently lacks the lateral resolution needed to resolve the 
structure of the nuclei and the particles that precede them. 

To obtain an experimental window onto the formation of a crystal 
nucleus in liquid at molecular resolution, we use cryo-transmission 
electron microscopy to image vitrified samples that have been plunge 
frozen at various time intervals. We study the nucleation pathways of 
glucose isomerase, a protein with applications in the biofuel and food 


industries as a crystalline suspension’*. Depending on the solution 
conditions, glucose isomerase can crystallize into at least two different 
space groups”, or (as we show here) can aggregate into a disordered, 
gelled state. Using ammonium sulfate as a precipitant, we find that 
the protein exhibits a polymorph transition from an [222 (rhombic) 
to a P2,2)2 (prismatic) space group as a function of the precipitant 
concentration (Extended Data Fig. 1). Turbidity measurements reveal 
that the induction time for nucleation decreases exponentially as the 
ammonium sulfate concentration increases from 1.2 M to 1.65M 
(Extended Data Fig. 2). However, no conditions are identified that 
lead to liquid-liquid phase separation or gelation. Cryo-transmission 
electron microscopy (cryo-TEM) imaging of the earliest quenched 
sample (plunge frozen after 20 seconds) in 1.5M ammonium sulfate 
(a mixed [222/P2,2;2 condition) shows the presence of elongated 
particle assemblies (‘nanorods’; Fig. la—e). Given the overall particle 
dimensions and the electron-microscopy silhouette of the subunits, 
we identify the building blocks of these nanorods to be single protein 
molecules (Fig. la—c). The nanorods are on average two molecules in 
width (1.7 +1; n=60) and 12 molecules in length (12.4 +5; n=60), 
with an intermolecular distance of 8.2 +0.1nm (n=51) along the long 
axis (Fig. 1d and Extended Data Fig. 3). Single-file protein chains are 
also observed (Fig. 1c), as well as trimers, tetramers and larger polymers 
at later time points (10-20 min; Fig. le). Although successive images 
show a gradual increase in the nanorod concentration as a function 
of time (Fig. 1a, b; see Extended Data Fig. 4 for the dependence on 
ammonium sulfate concentration), there is no increase in their length 
(Extended Data Fig. 3g). 

At around 15 to 30 minutes after protein-precipitant mixing, larger 
structures begin to emerge. We detect (sub)micrometre-sized fibres of 
43 +7 nm (n= 88) in width (Fig. 1f). The molecular columns that run 
along the fibre axis have a characteristic centre-to-centre distance of 
8.0+0.1nm (n=27), in line with the spacing measured for the nano- 
rods. The associated two-dimensional fast Fourier transform (2D-FFT) 
image does not show sharp diffraction spots, but rather diffraction 
arcs in one or two directions (Fig. 1f, inset). Such arcs indicate that 
there is local ordering, but also substantial deviation from the crystallo- 
graphic directions. The aspect ratio of these fibres ranges from 10 to 30, 
a considerable increase with respect to the aspect ratio of the nanorods 
(around 6), indicating that fibre broadening is slow compared with 
elongation. We also see bundles of individual fibres that are making 
loose lateral contacts with each other (Fig. 1g, h). These bundles have 
varying degrees of misalignment at the interfibre level, leading to dif- 
ferent levels of disorder. 

Within the same time frame, faceted nanocrystals start to appear, 
with morphologies and intermolecular distances that fit the P2)2)2 
and [222 space groups (Fig. 2a—e and Extended Data Table 1). The 
crystallinity of both particle types is reflected in the emergence of sharp 
diffraction spots in the 2D-FFT. The smallest observed rod-like P2,2,2 


1Université Grenoble Alpes, Université Savoie Mont Blanc, CNRS, IRD, IFSTTAR, |STerre, F-38000 Grenoble, France. #Laboratory of Materials and Interface Chemistry and Center of Multiscale 
Electron Microscopy, Department of Chemical Engineering and Chemistry, Eindhoven University of Technology, PO Box 513, 5600MB Eindhoven, The Netherlands. “Institute for Complex 
Molecular Systems, Eindhoven University of Technology, PO Box 513, 5600MB Eindhoven, The Netherlands. “Structural Biology Unit, CIC bioGUNE, Parque Tecnoldgico de Bizkaia, 48160 Derio, 
Bizkaia, Spain. ®Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium. ®Structural and Molecular Microbiology, Structural Biology Research Center, VIB, 


Pleinlaan 2, 1050 Brussels, Belgium. 


5 APRIL 2018 | VOL 556 | NATURE | 89 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


©, Nanorodimonome: 


Figure 1 | Pre-nucleation assemblies of glucose isomerase induced 
by ammonium sulfate. a, b, Cryo-TEM image of a glucose isomerase 
solution in 1.5 M ammonium sulfate, at 20 seconds (a) and 9 minutes (b) 
after mixing. The inset in panel a shows a van der Waals representation 
of a glucose isomerase molecule. The inset in panel b shows a class- 
average electron microscopic image of the nanorods’ building blocks. 
c-e, Magnified images of a nanorod monomer (c), dimer (d) and trimer 


crystal measures 380 nm by 120 nm (aspect ratio = 3), and has a width 
that exceeds those of the fibres (Fig. 2a). The characteristic interpla- 
nar spacing parallel to the long crystal axis is 8.1 + 0.1 nm (n= 26), 
again in line with the value obtained for the nanorods and fibres. The 
nanorod alignment parallel to the nearest facet of the crystallite in 
Fig. 2b suggests that oriented attachment is a mode of incorporation 
into the crystalline phase. For the rhombic [222 space group, the small- 
est crystals that we find have an edge length of +100 nm (Fig. 2d, e) and 
characteristic distances of 5nm and 7nm. 

With polyethylene glycol (PEG 99 or PEG 590) as the precipitant, 
glucose isomerase exhibits a similar polymorph transition from rhom- 
bic to prismatic crystals, albeit over a relatively narrow PEG concen- 
tration range (Extended Data Figs 1, 5). However, the highest PEG 
concentration produces a different effect to the highest ammonium 
sulfate concentration, in that glucose isomerase solidifies rapidly into 
a kinetically arrested gelled state (Extended Data Fig. 6). Cryo- TEM 
imaging of a range of PEG conditions reveals striking similarities to the 


Dosa U ost iaa pase 

(e) composed of glucose isomerase molecules, with a centre-to-centre 
distance of 8.2 + 0.1nm (m=51, marked in panel c). f-h, Emergence 

of larger structures (at 15-30 min). f, A micrometre-sized fibre, with 

the upper inset being a magnified image and the lower inset being the 
corresponding 2D-FFT image, showing diffraction arcs in two directions. 
g, h, Bundling of fibres into aggregate structures with varying degrees of 
alignment. The inset in panel g is a 2D-FFT image. 


nucleation pathways observed with ammonium sulfate. At 5% (w/v) 
PEG jo90 (an [222-only condition), we detect only mesoscale, rhom- 
bic crystals that exhibit fringe patterns compliant with the expected 
interplanar spacing of the /222 space group (Fig. 3a and Extended 
Data Table 1). Under conditions that lead to nucleation of both of the 
space groups and the gel (86 mg ml“, 4.5% (w/v) PEGi 500; Extended 
Data Fig. 5b), fibre-like structures appear 2-3 minutes after protein/ 
precipitate mixing; these structures have a characteristic intermolecu- 
lar distance of 8.0 +0.2nm (n= 24) along the long axis, and measure 
41+6nm (n= 166) in width (Fig. 3b-e). Interestingly, we observe 
no nanorods in any of the time points for this sample series, or for any 
glucose isomerase/PEG sample (Fig. 3b-e). At later time points, we 
see the grouping of these fibres into structures of increasing dimen- 
sions, exhibiting lateral stacking of individual fibres but still separated 
by a thin solvent layer (Fig. 3d, e). Identical sample replicates that 
failed to crystallize, but ended up in the gel state, reveal the presence 
of fibres that are morphologically similar to those described above 


Figure 2 | Emergence of faceted prismatic and rhombic nanocrystals 
15-30 minutes after mixing glucose isomerase and ammonium sulfate. 
a, A P2;2;2 nanocrystal. Inset, 2D-FFT image, with diffraction spots 
(measured 8.1 nm x 8.5nm, 88°; theoretical (110) plane 7.8nm x 8.1nm, 
90°). b, Larger P2)2)2 crystal displaying higher-order diffraction. c, Mature 
P2)2;2 crystal with a clear fringe pattern with a spacing of 8nm). The 
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magnified image of the facet edge resolves an unfinished molecular layer, 
suggesting that crystal growth at this point proceeds by incorporation of 
single glucose isomerase molecules. d, e, The smallest detected rhombic, 
crystalline objects display diffraction spots in two directions, that is, 
7nm and 5 nm at 90° (theoretical (101) plane 6.7 nm x 5nm; (011) plane 
7.3nm x 4.3nm). Blue arrows show contamination by ethane. 
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Figure 3 | Cryo-TEM imaging of PEG-induced crystallization of glucose 
isomerase. a, Micrometre-sized rhombic crystals obtained 4 minutes 

after mixing glucose isomerase and PEG at a low depletion attraction (in 
5% PEGj000). Lower left inset, a magnified image of the resolved crystal 
lattice. Lower right inset, the corresponding 2D-FFT image. Upper 

inset, magnification of the upper facet edge, resolving free monomers in 


(of width 44 + 5nm; n= 100), but at drastically higher concentrations 
(Fig. 3f). With higher depletion—attraction forces (through the use 
of 7% PEG 009) but lower protein concentrations (37.5mg ml-!)— 
conditions that slow down the rate of aggregation—cryoTEM imag- 
ing before kinetic arrest (6 min after protein/PEG mixing) reveals the 
formation of fractal-like aggregates with a distinct lack of rotational 
order. This can also be seen from the two concentric arcs ina 2D-FFT 
image (Fig. 3g). The FFT image is reminiscent of those of the disor- 
dered fibre bundles observed in the ammonium sulfate experiments, 
but shows higher packing density, as indicated by the 7.0 + 0.2nm 
(n = 16) spacing (Fig. 3g) and broader fibre cross-section (of width 
100 +50nm; n=20). 

The striking resemblance at the microscopic level between the 
P22;2 crystallization pathways (Fig. 1, with ammonium sulfate; or 
Fig. 3b-e, with PEG) and the gelation pathway strongly suggests that 
both phases originate from the same precursor states (that is, fibres). 
This observation prompted us to perform seeding experiments using 
glucose isomerase/PEG hydrogels, to see whether we could selectively 
elicit P22)2 crystals. For this, we transferred a glucose isomerase/ 
PEG hydrogel fragment to a freshly prepared mother liquor solution 
that leads exclusively to [222 crystals (Fig. 5a). Time-lapse imaging of 
the solution—gel interface reveals the rapid and exclusive formation 
of P2;2,2 crystals on, or protruding from, the gel phase, demonstrat- 
ing that gels can be used as polymorph-specific seeding agents. If 
the hydrogel is instead transferred to a similar 1222-exclusive con- 
dition but a lower PEG concentration (3% (w/v) PEGjs99), then the 
gel phase gradually dissolves as P22)2 crystals emerge over time 
(Fig. 5f, g). 

Both the early-stage nanorods (formed in high ammonium sulfate 
concentrations) and the later-stage fibres (formed in high ammonium 
sulfate or PEG) exhibit high aspect ratios, suggesting that there are 
substantial differences in lattice-contact strengths (|C;|) for these con- 
ditions. To understand the origins of these differences, we analyse the 
mode of intermolecular bonding within the nanorod structure and 
compare it with known crystal lattice contacts. Using the glucose 
isomerase atomic structures for both space groups, we generate nine 
plausible nanorod models (Extended Data Fig. 7). On the basis of clear 
discrepancies between the intermolecular distances in these models, 
and a comparison of the cryoTEM silhouette and the van der Waals 
model, the only plausible orientation of the nanorod from the P2;2)2 
space group is in the (001) direction (Fig. 3 and Extended Data Fig. 2). 
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solution. b-e, Time series of a glucose isomerase/PEG sample with 4.5% 
PEG}500 reveals a P2)2;2 nucleation pathway reminiscent of that observed 
with ammonium sulfate (Fig. 1), but with a clear absence of nanorods 

at all measured time points. f, g, Formation of a glucose isomerase/PEG 
hydrogel at an intermediate depletion attraction (f, with 4.5% PEG 599) 
and a high depletion attraction (g, with 7% PEG 90). 


There are two lattice contact types in the P2)22 space group—C, along 
the (001) direction, and C along the (110) direction, involving the 
formation of six and seven hydrogen bonds, respectively (Fig. 4a, b 
and Extended Data Table 2). The nanorod anisotropy suggests that 
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Figure 4 | Lattice contact analysis for both space groups of glucose 
isomerase. a, Contact types for the P2)2)2 and [222 space groups, with 
magnifications of the corresponding surface patches and hydrogen 

bonds, and showing the amino acids at the contact sites (T, threonine; 

Q, glutamine; E, glutamate; R, arginine; D, aspartate; K, lysine; G, glycine). 
Hydrogen bonds are shown with solid lines. b, Depiction of the molecular 
columns that run through a P2;22 glucose isomerase crystal, with the 
c-axis corresponding to the long axis of the nanorods observed in cryo- 
TEM. C, and C3 are contacts 1 and 2. 
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Figure 5 | Gel-mediated nucleation of P2)2)2 crystals. a, Glucose 
isomerase/PEG hydrogel formed with 86 mg ml! glucose isomerase 

(GI) and 7% (w/v) PEGisoo. b, An aliquot of the gel shown in panel a was 
transferred to a freshly prepared solution of 86 mg ml’ glucose isomerase 
and 4% (w/v) PEG 509. c-e, Consecutive magnified images of the gel- 
solution interface, at respectively 2 minutes, 17 minutes and 46 minutes 
after transfer of the gel aliquot. f, An aliquot of the gel shown in panel 

a was transferred to a freshly prepared solution of 86 mg ml’ glucose 
isomerase and 3% (w/v) PEG;509. g, Formation of P2)2)2 crystals is linked 
to gradual dissolution of the gel phase. 


|C,| is much greater than |C)|. This bond hierarchy can be rationalized 
by considering the salting-out effects induced by ammonium sulfate; 
these effects include preferential hydration, salt exclusion in the vicin- 
ity of the surface”°, and increased costs of solvent cavity formation”!. 
Sulfate ions are excluded with varying degrees from a macromolecu- 
lar surface’, with local negative charges contributing strongly to the 
preferential expulsion”®. This in turn leads to a net attraction when 
two macromolecules are close to each other, owing to an imbalance in 
the osmotic pressure**. Thus the strength of the C; contact is probably 
a direct consequence of the 16 negative charges that are buried upon 
formation, compared with the five such buried negative charges in C2. 
Such symmetry breaking will be less pronounced when the precipitant 
is PEG, which induces a more isotropic attraction, leading to rhom- 
bic nuclei, at low concentrations. On the other hand, PEG-induced 
depletion attraction is not likely to be perfectly uniform for anisotropic 
particles, as it will favour protein-protein interactions that maximize 
the overlap volume” and, by proxy, the total buried surface area upon 
complexation. The C; contact has the largest difference in accessible 
surface area (AASA), followed by C2 and C3 (Extended Data Table 2). 
We argue that this contributes to the emergence of P2,2)2 crystals in 
intermediate PEG concentrations, where the differences between the 
contacts become amplified. 

An additional level of polymorph control can be gained by means of 
site-directed mutagenesis: knowing the amino-acid composition of spe- 
cific lattice contacts allows one to tune their strength. We designed three 
classes of glucose isomerase mutant that are selectively perturbed in the 
Cy, C2 or C3 modes of interaction. We predict that mutant proteins with 
impaired C, or C, contacts will not form P2;2)2 crystals; mutants with 
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altered C3 contacts should be [222 ‘knockouts: We used crystallization 
screening of said mutants to investigate the strategy of polymorph con- 
trol by mutagenesis. As predicted, mutants with defective C; contacts 
(in these S171W mutants, the amino acid at position 171, serine, was 
mutated to tryptophan) or defective C, contacts (R387A mutants, with 
arginine 387 mutated to alanine; and GI_His mutants, in which the 
protein’s carboxy terminus was tagged with a run of histadine resi- 
dues) no longer produced P22;2 crystals in the tested conditions, but 
still nucleated into the 1222 space group (Table 1 and Extended Data 
Fig. 8a). The opposite was true of C3 mutants (R331A plus R340D, 
where D is aspartate). Seeding experiments using wild-type glucose 
isomerase microcrystals complement the results of the nucleation tri- 
als: wild-type P2;2;2 crystals exhibited no growth when transferred to 
solutions containing C; or Cz mutants; similarly, wild-type [222 crystals 
exhibited no growth in solutions of the C3 mutant (Table 1). Notably, 
cryoTEM images of $171W, R387A and GI_His crystals in high con- 
centrations of ammonium sulfate reveal the presence of amorphous 
aggregates, rather than the nanorod or fibre assemblies seen with the 
wild-type protein (Extended Data Fig. 8b). 

To summarize, when in the presence of a high concentration of 
ammonium sulfate, the glucose isomerase solution undergoes a rapid 
decomposition into nanorods that have a quaternary structure similar 
to the molecular arrangement along the c-axis of the P2,2;2 space 
group. At later time points, fibres are formed (at either high ammo- 
nium sulfate or intermediate PEG concentrations) that again have 
identical intermolecular distances to each other along their long axis. 
Such high-aspect-ratio assemblies are not observed under conditions 
that exclusively yield the 1222 polymorph, nor do they have a structure 
that is compatible with the crystal lattice of the latter. The fibres are 
therefore exclusively precursors to the prismatic P2,2;2 polymorph. 
For the ammonium sulfate pathway, our observations suggest that 
nanorods are the primary building blocks of a next-level self-assembly 
process that leads to the formation of nanorod oligomers, and subse- 
quently to fibre-like assemblies. Having said that, the data obtained 
with intermediate PEG concentrations show that fibres can also be 
formed in the absence of a nanorod phase. Fibres increase in width 
by lateral attachment, which involves the formation of a large num- 
ber of interprotein bonds—a complex process that can lead to kinetic 
traps, as shown by the disorder seen in many fibres (Fig. 1f). Thus, 
assembly size and crystallinity are order parameters that can evolve 
independently of each other. We hypothesize that local relaxation from 
a strained, more disordered state—as seen in many fibrous assem- 
blies—into the crystalline arrangement is associated with an activation 
barrier that is prohibitively large, yielding disordered fibre assemblies 
that represent a metastable trap in the protein-assembly pathway. 
Samples in low PEG concentrations show a total absence of nanorods, 
higher-order assemblies thereof, or any disordered, liquid-like phases. 
We detect only faceted crystalline objects, suggesting that [222 crystals 
follow a direct nucleation pathway with monomers as their principal 
building blocks. 

The various pathways seen during the crystallization of glucose 
isomerase reveal a mechanism of protein polymorph selection that 
takes place at the earliest measurable stages (20 seconds) of self- 
assembly (Fig. 6). The primary multimers that are formed have an 


Table 1 | Nucleation trials of glucose isomerase mutants and crystal 
growth tests using wild-type seeds 


1222 P2\2,2 
Altered 1222 P2,2)2 seed seed 
Mutant contact nucleation* nucleation growth* growtht 
S171W C, v x _ x 
R387A Co v x — x 
GI_His Co v x x x 
R331A/R340D C3 x v = 


*50 mM HEPES 7.0 buffer, 10O mM MgClo, 4% (w/v) PEGiooo- 
+50mM HEPES 7.0 buffer, 1OOmM MgClo, 1.55M ammonium sulfate. 
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Figure 6 | Proposed model for glucose isomerase crystallization. Top 
left, high concentrations of ammonium sulfate (AS) induce a strong 
anisotropic interaction between glucose isomerase molecules, which 
leads to nanorod formation. These nanorods self-assemble into either 
disordered fibre bundles or P2;2)2 crystals. Bottom left, at low PEG or 
AS concentrations, glucose isomerase follows a direct nucleation pathway 


architecture that already resembles the crystalline state. This direct 
nucleation mechanism can be attributed to the mode of interaction 
between the glucose isomerase molecules, which is a combination 
of isotropic repulsion and anisotropic attraction!+. Such interaction 
potentials affect the emergent nucleation pathway, as they disfavour 
disordered dense states**”’. Self-organization of monomers into (pre) 
critical clusters with a pronounced symmetry determines their sub- 
sequent assembly path at their points of inception”®. Most unexpect- 
edly, the rod-shaped cluster nucleation pathway for glucose isomerase 
diverges from the two-step nucleation model for proteins”? that has 
gained traction recently, but is perhaps more reminiscent of the cluster- 
cluster interaction at high supersaturation that is described by classical 
nucleation theory. 

To date, control over emerging polymorphs has been based mostly on 
detailed knowledge of phase diagrams, and has focused predominantly 
on solubility differences between polymorphs. By contrast, our insights 
into the mechanism of polymorphism could inspire selection strategies 
that are geared towards controlling the modes of interaction, including 
directionality and kinetics. By (de)stabilizing the modes of interaction 
that are specific to each polymorph, one can control the throughput 
of the various nucleation pathways, and ultimately influence the yield 
of the desired polymorph. Such an approach could aid in the develop- 
ment of hydrogel- and crystal-based biotherapeutic agents that require 
precise control over the outcome of macromolecular phase transitions. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Protein production and purification. Glucose isomerase was obtained from 
Hampton Research (from wild-type Streptomyces rubiginosus) and received as 
a crystalline slurry. Small aliquots were dialysed overnight (using Spectra/Por 
standard regenerated cellulose (RC) tubing, molecular weight cut-off (MWCO) 
12-14kDa; SpectrumLabs) against 10 mM HEPES pH 7.0 buffer plus 1 mM MgCl 
at 4°C. The protein solution was concentrated using a centrifugal filter with a 
MWCO of 100 kDa (Amicon Ultra-15 Cellulose, Milipore) to a typical concen- 
tration of 200-250 mg ml! and stored at 4°C. Concentrations were determined 
by measuring the absorbance at 280 nm using an extinction coefficient €2g9 of 
1.074mg-!mlcm7!. 

Synthetic DNA of full-length wild-type glucose isomerase (UniProt, P24300) 
with a carboxy-terminal His, tag (GI_His), and mutants ($171 W, R387A, R331A/ 
R340D) with no carboxy-terminal His, tag cloned into plasmid pET22b via Ndel 
and NcoI restriction sites, was ordered at GenScript. Recombinant proteins were 
expressed in Escherichia coli strain BL21(DE3) after induction at an optical den- 
sity (OD)600nm of 0.7 with 1 mM isopropyl 3-p-1-thiogalactopyranoside (IPTG) 
for 3h at 37°C. Cells were harvested by centrifugation at 6,238g for 15 min and 
resuspended in 100 mM Tris-HCl pH 7.3, 1 mM ethylenediaminetetra acetic acid 
(EDTA, 4ml per gram of wet cells) supplemented with 5|1M leupeptin, 1 mM 
4-(2-aminoethyl)benzenesulfonyl fluoride (AEBSF), 100j1g ml! lysozyme and 
20 pg ml"! DNase I, and incubated for 30 min at 4°C. Subsequently, MgCl, was 
added to a final concentration of 10mM and cells were lysed by two passages in 
a Constant System Cell Cracker at 20 kilopounds per square inch (kpsi) at 4°C; 
cell debris was removed by centrifugation at 48,400g for 45 min at 4°C. The cyto- 
plasmic extract was incubated for 10 min at 65°C and the insoluble fraction was 
removed by centrifugation at 48,400g for 45 min at 4°C. 

For the non-His-tagged constructs, the supernatant was filtrated through a 
0.22 um pore filter and loaded on a 5 ml pre-packed Hitrap Q FF column (GE 
Healthcare) equilibrated with buffer A (50 mM bis-tris-HCl pH 6.0, 10mM NaC)). 
The column was then washed with 40 bed volumes of 20% buffer B (50 mM bis- 
tris-HCl pH 6.0, 1 M NaCl) and bound proteins were eluted with a linear gradient 
of 20-50% buffer B over 10 bed volumes. Fractions containing wild-type or mutant 
glucose isomerase—as determined by sodium dodecyl sulfate polyacrylamide 
gel electrophoresis (SDS-PAGE)—were pooled and supplemented with ammo- 
nium sulfate to a final concentration of 1.5 M and loaded on a 5 ml pre-packed 
HiTrap Phenyl HP column (GE Healthcare) equilibrated with buffer A (100 mM 
Tris pH 7.3, 1.5M ammonium sulfate). The column was then washed with 40 
bed volumes of 25% buffer B (100 mM Tris pH 7.3) and bound proteins were 
eluted with a linear gradient of 25-85% buffer B over 15 bed volumes. Fractions 
containing wild-type or mutant glucose isomerase, as determined by SDS-PAGE, 
were pooled and dialysed (Spectra/Por standard RC tubing, MWCO 12-14kDa; 
SpectrumLabs) against 10 mM HEPES pH 7.0 plus 1 mM MgCl, overnight at 4°C 
(the buffer was replaced twice), and concentrated ina MWCO 100kDa spin con- 
centrator (Amicon Ultra-15 Cellulose; Milipore) to a typical final concentration of 
30mg ml~!. (For the $171W mutant, ¢2g9 = 1.198 mg! ml cm™!).Cleared cytoplas- 
mic extracts of GI_His were loaded on a 5 ml pre-packed Histrap Ni-NTA column 
(GE Healthcare) equilibrated with buffer A (50 mM Tris-HCl pH 7.3, 500 mM 
NaCl and 20 mM imidazole). The column was then washed with 40 bed volumes 
of buffer A, and bound proteins were eluted with a linear gradient of 0-75% buffer 
B (50 mM Tris-HCl pH 7.3, 500 mM NaCl and 500 mM imidazole) over 15 bed 
volumes. Fractions containing GI_His were pooled, dialysed and concentrated as 
indicated above. 

Glucose isomerase crystallization. For a typical crystallization experiment with 
wild-type glucose isomerase, the protein stock solution was first diluted to a con- 
centration of 75 mg ml! in 50mM HEPES pH 7.0 and 100mM MgCh, and then 
mixed at 22°C with an equal volume of a buffered ammonium sulfate, PEG; 900 
or PEG}599 solution that was at a concentration twice that desired after mixing. 
Final concentrations ranged from 0.5 M to 1.75 M ammonium sulfate, and from 
3% to 7% (w/v) PEG, 009 or PEG59. Space-group determination was based on the 
distinct crystal morphologies of both space groups (Extended Data Fig. 1a), using 
a wide-field optical microscope. The phase diagrams in Extended Data Fig. 1b, c 
were determined by setting up triplicate crystallization tests using the micro- 
batch-under-oil method (with Nunc 72 microwell minitrays; Sigma Aldrich) at 
22°C, with 10:1 drops of mother liquor. 

Precipitant dependence of glucose isomerase crystallization. We began by map- 
ping out the concentration dependence of glucose isomerase polymorphism for the 
precipitants used here. Our starting point was 50 mM HEPES pH 7.0 plus 100 mM 
MgCh,, which yields exclusively rhombic (J222) crystals for a broad range of protein 
concentrations (20-75 mg ml~!). By supplementing that condition with ammo- 
nium sulfate in 100-200 mM increments, from 0.5 M to an upper limit of 1.75 M 
final concentration, we saw a gradual shift from rhombic to prismatic (P2122) 
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crystals (Extended Data Fig. 1b). Similarly, by supplementing the base condition 
with either PEG, 999 or PEG;s99 to a final concentration of 3% to 8.5% (w/v), we 
recorded a gradual shift from [222 to P2,2)2 crystals; at the higher PEG concen- 
trations, a dense gel phase was formed. We note that there was a narrow PEG 
concentration range (for PEGjs00, from 4.5% to 5.5%) where [222 crystals, P2;2\2 
crystals and gels were observed simulataneously (see Extended Data Fig. 5 for a 
detailed microscopic record). Gelation depended only weakly on glucose isomerase 
concentration: the gelation line occurs as a vertical in Extended Data Fig. 1. 
Induction-time measurements. We determined induction times (ting) for 
glucose isomerase crystallization as a function of ammonium sulfate concentration 
(1.2-1.65 M) by following the change in absorbance of freshly prepared supersat- 
urated solutions. We monitored the increase of absorbance in the reacting solu- 
tions by in situ time-resolved ultraviolet-visible spectroscopy (Agilent Cary 300E 
spectrophotometer). Measurements were carried out at a wavelength of 500 nm 
(absorbance of individual glucose isomerase molecules is minimal at this wave- 
length) and performed in poly (methyl methacrylate) (PMMA) cuvettes located 
inside a peltier thermostated cell module at 20°C. The time elapsed between the 
mixing of protein and salt solutions and the first observed change in turbidity was 
taken as the induction time. This point was determined through linear fitting of 
the sigmoidal absorbance curve near the inflection point and determining the 
intersection with the x-axis. 

To obtain a general idea about the nucleation kinetics and to estimate the 

nucleation induction time, we monitored the turbidity of the crystallizing solu- 
tion between 1.2 M and 1.65 M ammonium sulfate, and obtained an exponential 
dependence of ting on ammonium sulfate concentration (Extended Data Fig. 2). 
We later used these simple estimates of ting as a guide for the preparation of cryo- 
TEM samples, to determine the desired number of samples and time intervals for 
each condition. We also note that, under high ammonium sulfate concentrations, 
the system undergoes near-spinodal decomposition with respect to the crystalline 
phase. Wide-field light microscopy imaging confirms that, even under these con- 
ditions, glucose isomerase solidifies exclusively into the (P2;2)2) crystalline state. 
We find no evidence that any amorphous solid states are formed; nor is there any 
indication that a liquid-liquid phase boundary is crossed. 
Time-resolved dynamic light scattering. We collected intensity correlation func- 
tions of mixed protein-precipitant solutions at 20°C, using 10 mm cylindrical 
cuvettes at an angle of 90°, and employing an ALV-CGS-3 static and dynamic light 
scattering (DLS) device with a 22 mW helium-neon laser (wavelength 632.8 nm). 
We collected data in a pseudo cross-correlation set-up in order to minimize the 
contribution of dead time effects and of photomultiplier-tube after-pulsing to 
the recorded signal. The intensity autocorrelation function go(7) — 1, with 7 being 
the delay time, is connected to the electric-field correlation function gi(T) — 1 
through the Siegert relation*: 


g(r) =B + Ble (DP) 


where B is the baseline of the correlation function at infinite delay, and /3 is the 
function value at zero delay. For samples containing PEG; 909, we used the following 
double-exponential function (equation (1) is used to fit g)(7) at time points before 
kinetic arrest, and equation (2) is a stretched exponential used after gelation has 
occurred): 


g(t) = Aye T+ Age! (1) 


gin=e" (2) 


where p is a fitting parameter, and "= Dq’is the decay rate defined by the diffusion 
coefficient D of the particles and the magnitude of the scattering vector 
q=4nn/ X sin(6/2) at the scattering angle 0. 

We collected time-lapse DLS acquisitions to follow, in real-time, the crystal- 
lization of filtered glucose isomerase solutions in 50 mM HEPES pH 7.0 plus 
100mM MgCh, using a 1.5M ammonium sulfate solution and 48 mg ml! 
glucose isomerase. The time evolution of the intensity correlation curve is shown 
in Extended Data Fig. 6a. Processing of the raw curves with the ALV-correlator 
software (ALV-7004 v3.0.5.1), using the regularization method, yields hydro- 
dynamic radii of the various glucose isomerase populations that form in solution. 
Thirty seconds after glucose isomerase/ammonium sulfate mixing, light scattered 
by the glucose isomerase monomers only was collected (measured hydrodynamic 
radius R;, = 8nm). Over the course of minutes, a second shoulder started to form in 
the correlogram, with the earliest measurable species (at 4 min) corresponding to 
micrometre-sized particles (denoted as ‘clusters’). These species rapidly grew until 
they completely dominated the recorded signal (by 14 min). Visual inspection at 
this point showed that the sample had become opaque. Ex situ wide-field micros- 
copy analysis after 30 min confirmed the presence of P2)2;2 nanocrystals with a 
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small minority of [222 crystals. On the basis of the typical nanorod dimensions 
determined by cryoTEM (length 100 nm; aspect ratio 6), we predict—following the 
corrections described in ref. 31—that they would have an apparent hydrodynamic 
radius of + 45 nm. Given the results discussed above, we conclude that we could 
not detect any light scattered by particles in this size range. 

We also tested conditions that do not yield an ordered solid, but instead lead to 
a kinetically trapped gel state. Time-resolved DLS of a solution of 50 mM HEPES 
pH 7.0, 100 mM MgCl, 7% (w/v) PEGyo00 and 25 mg ml! glucose isomerase 
showed that the intensity auto-correlation function (ACF) could be fit at early 
time points with a double-exponential decay (with a fast-diffusing population 
corresponding to monomers, and a slowly diffusing population corresponding to 
clusters that grow as a function of time). At later stages a stretched exponential was 
required to reproduce the ACF (Extended Data Fig. 6b, c). Stretched exponentials 
indicate a hierarchy of fluctuations on all length scales and are a well known char- 
acteristic of gels. Using optical microscopy, we obtained a visual confirmation of 
the gelled state. The inset of Extended Data Fig. 6c clearly resolves the pores that 
are present in the mesh of fibres in the kinetically arrested state. 

Seeding experiments using glucose isomerase hydrogels. Crystallization trials 
using PEG as a precipitant showed that glucose isomerase exhibits [222/P2,2,2 
polymorphism over a narrow concentration range (Extended Data Figs 1, 5). 
For concentrations lower than or equal to 4% (w/v) PEG;500, we observed only 
1222 crystals. Conversely, for concentrations higher than or equal to 6% (w/v) 
PEG}500, we obtained opaque glucose isomerase hydrogels. At 4.5% (w/v) PEGi500; 
1222/P2,22 polymorphism occurred, with strongly varying nucleation densities 
for the P2)2)2 space group; in some cases a gelatinous phase also formed that 
seemed to enter into competition with the crystalline phases. We observed a sim- 
ilar transition regime for PEG; 090, but shifted towards higher PEG concentrations 
(Extended Data Fig. 1). We transferred a small gel fragment grown at 7% (w/v) 
PEG)500 (Fig. 5a) to a freshly prepared solution that was identical in composition 
but of a lower PEG) 599 concentration ((w/v) 4%; Fig. 5b). Time-lapse imaging of the 
gel-solution interface revealed the rapid and exclusive formation of P2)2)2 crystals 
on, or protruding from, the gel phase (Fig. 5c-e). Transferring a gel fragment to 3% 
(w/v) PEGjs00, however, led to the gradual dissolution of the gel phase as P2,2;2 
crystals emerged over time (Fig. 5f, g). 

Crystallization of GI_His and mutant proteins. To gain more control over 
the polymorph selection process, we designed and produced glucose isomerase 
mutants that we predicted to affect space-group-specific intermolecular contacts 
while leaving all other contacts unchanged. We had three different types of mutant, 
impairing C; contacts (the $171W mutant, with steric inhibition), C2 contacts (the 
R387A mutant, with a salt bridge removed, and GI_His, with steric inhibition) 
or C3 contacts (the R331A/R340D mutant, with charge inversion). We predicted 
that mutants with defective C; and/or C interactions would form exclusively [222 
crystals, whereas impaired C3 constructs would form just P2,2;2 crystals. 

We gauged our ability to control polymorph selection through site-directed 
mutagenesis by setting up crystallization trials for the new constructs, using con- 
ditions that lead (almost) exclusively to either 1222 or P2,2)2 crystals with wild- 
type glucose isomerase. Thus, 50 mM HEPES pH 7.0, 100 mM MgCl, and 1.55 M 
ammonium sulfate leads predominantly to P2;2)2 crystals, whereas 50 mM HEPES 
pH 7.0, 100 mM MgCl and 4% (w/v) PEG;o90 favours the nucleation of the 1222 
space group. If no crystallization (of either space group) could be induced with 
the selected mutant under these conditions, we set up grid screens by varying the 
precipitant concentration. If only one space group could be obtained after such 
a screening, we classified the tested mutant as [222-negative or P2|2,2-negative 
(Table 1 and Extended Data Fig. 8a). We note that any crystallization screening is 
inherently finite, and therefore cannot be used to conclusively rule out the absence 
of a particular polymorph throughout all of chemical space. Hence, as an auxiliary 
method, we set up seeded crystallization tests using pre-grown wild-type [222 or 
P2,2;2 glucose isomerase crystals, which we then washed in their corresponding 
mother liquors to remove any soluble glucose isomerase species, and transferred to 
an identical mother liquor solution supplemented with 10 mg ml! of the respective 
mutant. We monitored the growth of these seed crystals over time using wide-field 
microscopy (Table 1). 

Cryo-TEM. For cryo-TEM, we used 200-mesh copper grids with Quantifoil R 
2/2 holey carbon films (Quantifoil Micro Tools GmbH). We prepared samples 
using an automated vitrification robot (FEI Vitrobot Mark III) for plunging in 
liquid ethane™’. Before use, all TEM grids were surface plasma treated for 40 sec- 
onds using a Cressington 208 carbon coater. We studied the samples with the 
Technische Universiteit Eindhoven/FEI cryoTITAN (www.cryotem.nl) operated 
at 300kV, equipped with a field emission gun (FEG), a post-column Gatan energy 
filter (GIF) and a post-GIF 2k x 2k Gatan charge-coupled-device camera. We 
choose fo as the moment at which we induced supersaturation with respect to the 
crystalline phase (that is, when we mixed the protein with the precipitant solution) 


and feng as the time at which crystals became detectable using light microscopy. 
The exact time point of the samples as indicated in the main text was defined 
as the moment (after blotting excess liquid) when the electron-microscopy grid 
was plunged into the liquid ethane. The selected solution conditions represent a 
compromise between the nucleation density and the overall rate of transforma- 
tion—that is, for TEM one needs on the one hand a high enough particle density, 
and on the other slow enough kinetics to manage the cryogenic-quenching at 
constant time intervals (roughly 2 minutes). We acquired images in low-dose 
mode at a magnification of either x 24,000 with a nominal defocus of —5 1m, or 
11,500 and —10 um defocus. 

Single-particle data processing and projection approximation. We determined 
the defocus of the micrographs by using a script developed in-house (written by 
R. Efremov). We manually picked and stacked 1,240 particles in E2BOXER. A low- 
pass Gaussian filter was applied to remove excessive high-frequency noise, and the 
contrast was inverted before classification. We carried out two-dimensional class 
averaging by K-means classification using a soft circular mask, and then performed 
a multi-reference alignment using SPARX™. 

To approximate a low-dose cryoTEM projection of the rod assemblies (Extended 

Data Fig. 8), we used a Protein DataBank (PDB) model. From the PDB model (con- 
taining all atom coordinates), we created a three-dimensional density map of the 
rod via Chimera 1.12 (using the molmap command). Each atom is described as a 
three-dimensional Gaussian distribution of width proportional to the resolution 
(3nm at —5 1m defocus) and amplitude proportional to the atomic number. The 
pixel size was set to 0.4nm, which is close to the pixel size of acquisition (0.38 nm). 
We summed this three-dimensional intensity map in Matlab along the y-direction 
perpendicular to the rod length, creating a density projection of the structure. 
The TEM image was approximated by subtracting the density projection from a 
flat background image containing Poisson noise (mean intensity = 100 electrons 
per pixel, as the beam intensity during cryoTEM imaging). Fresnel fringes (white 
lines surrounding glucose isomerase) arising from the applied underfocus during 
imaging were not included in the simulation. 
Distances along nanorods, fibres, crystals and gel fibres. To obtain estimates of 
the intermolecular distances within glucose isomerase’s nanorod structures, we 
plotted the power spectrum of the greyscale values along the long axis of a single 
nanorod, and identified two dominant frequencies that correspond to the char- 
acteristic intermolecule and intramolecule distances (Extended Data Fig. 3a—d). 
Ina second approach, we calculated 2D-FFTs using Image] 1.50i (ref. 35) from an 
entire TEM image containing numerous nanorods lying in random orientations 
(Extended Data Fig. 3a). This orientational averaging yielded an FFT image con- 
taining two concentric circles, whose radii again corresponded to the intermolecule 
and intramolecule distances (Extended Data Fig. 3e). We applied orientational 
averaging to the 2D-FFT (Extended Data Fig. 3f) and took the inverse frequencies 
of the two maxima. Applying this second approach to 51 images, we obtained a 
value of 8.2 + 0.1 nm for the intermolecular distance along the long nanorod axis 
(Extended Data Fig. 3g). We used a similar method to determine the characteristic 
distance within the fibre structures and for the nanocrystals, but used selections 
corresponding to specific regions of interest (fibre or crystal outline). Also, in the 
orientational averaging step of the 2D-FFT, we calculated the radial profile over a 
range of 20° and 5° for fibres and crystals, respectively, instead of 180° for the nano- 
rods. Using 27 fibres, we obtained a characteristic distance of 8.0 + 0.1 nm along 
the long axis, and using 29 crystals, we measureed 8.1 + 0.1 nm in the c-direction. 
For the gel fibres, we integrated over 20° and obtained a spacing of 8.0+0.2nm 
(4% (w/v) PEGjs09) and 7.0 + 0.2 nm (7% (w/v) PEG;s99), on the basis of 24 and 
16 measurements, respectively. 
Crystallographic analysis. As starting models for the atomic structures of 
glucose isomerase in the P2;2;2 and [222 space groups, we used the biological 
assembly models of entries LOAD and 9XIA in the RCSB PDB (https://www.rcsb. 
org). We generated nearest crystallographic neighbours of the glucose isomerase 
molecule for both space groups using Chimera 1.11.2rc (Fig. 4). We identified 
residues that partake in lattice contacts by calculating the accessible surface area 
(ASA) on a per-residue level, using AREAIMOL of the CCP4 software suite*®, 
We determined ASAs for both of the starting models, and for the models con- 
sisting of glucose isomerase and its nearest neighbour, by using a probe radius 
of 1.4A. Residues with a non-zero AASA are (partially) buried in the bound 
complex and therefore considered to be part of the lattice-contact patch. We 
identified hydrogen-bond pairs with the FindHBond tool in Chimera 1.11.2rc 
using default settings, and salt bridges using the PDBePISA (http://www.ebi. 
ac.uk/pdbe/pisa/) and the 2P2] (http://2p2idb.cnrs-mrs.fr/2p2i_inspector.html) 
protein-interaction webservers. 

Given these two models (LOAD and 9XIA) of glucose isomerase for both space 
groups, we used crystallographic symmetry operations to generate a number 
of plausible candidates for the experimentally observed nanorods. For this, we 
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identified the various classes of lattice contact (see below) that exist in both space 
groups, and applied translational and rotational symmetry operations using the 
Supercell plugin (https://pymolwiki.org/index.php/Supercell) to Pymol to con- 
struct linear glucose isomerase sequences in space. A basic requirement that 
we set is that adjoining glucose isomerase molecules must be in direct contact 
with each other, be it through van der Waals, hydrogen-bond or electrostatic 
interactions. We discarded loose packing structures with water-mediated contacts 
only (Extended Data Fig. 8). Next, we compared the intermolecule distances and 
particle silhouettes to identify a potential match with the nanorods that were 
imaged using cryoTEM. Nanorods constructed along the (100), (010), (001) and 
(011) directions of the 1222 spacegroup, and along the (100), (010) and (110) 
directions for the P2)2)2 spacegroup, all have a helical ultrastructure that has 
a pitch defined by the respective unit-cell dimensions. The only linear array of 
glucose isomerase molecules that could be generated is that in the (001) direction 
for the P22,2 space group, and the (111) direction for 1222. Careful comparison 
of the orientations of the molecules with respect to the nanorod axis led us to 
conclude that the P2,22 (001) model is the most plausible. Indeed, juxtaposing 
the cryoTEM image of a single nanorod with the van der Waals surface representa- 
tion of our P2;2;2 (001) nanorod model showed good agreement between the 
two. Also, the P2;2;2 (001) nanorod had an intermolecular distance of 7.83 nm 
when using PDB 1OAD as a reference structure. The nanocrystals that we grew, 
however, were less compact. The intermolecular distance (along the c-axis; based 
on the fringe pattern in the cryoTEM images) is 8.1 + 0.1 nm—a good match to 
the experimental intermolecular distance within the nanorods (8.1 + 0.2 nm). We 
therefore conclude that the nanorods that are formed in high ammonium sulfate 
concentrations are linear molecular arrays that can also be found along the c-axis 
ofa mature P2;2)2 glucose isomerase crystal. 

Lattice contacts. For the P222 space group, we identified two types of lattice 
contact that involve three different surface patches, designated P1, P2a and P2b 
(Extended Data Table 2). Contact 1 (C;) is made in the (001) direction by the 
self-recognition of patch P1, and is duplicated owing to the non-crystallographic 


LETTER 


twofold symmetry of the glucose isomerase tetramer. The total contact therefore 
includes the formation of 2 x 3 hydrogen bonds and has a AASA of 844 A?. Contact 
2 (C2) along the (110) direction is formed by the binding of P2a with P2b, involving 
the formation of seven hydrogen bonds and two salt bridges, and encompassing 
a total AASA of 622 A?. For the [222 space group, there is just one lattice-contact 
type (contact 3, C3), which involves two surface patches, Ia and Ib, making three 
hydrogen bonds and resulting in a net AASA of 372 A. Note that these patches 
are unique to their respective space groups, although P2a and Ia share one amino 
acid (D81). 

Data availability. We declare that the data supporting the findings of this study 
are available within the paper and the Extended Data figures and tables. Further 
data are available from the corresponding authors upon request. 


30. Berne, B. J. & Pecora, R. Dynamic Light Scattering: With Applications to 
Chemistry, Biology, and Physics (Courier Dover Publications, 1976). 

31. Ortega, A. & Garcia de la Torre, J. Hydrodynamic properties of rodlike 
and disklike particles in dilute solution. J. Chem. Phys. 119, 9914-9919 
(2003). 

32. Krall, A. H. & Weitz, D. A. Internal dynamics and elasticity of fractal colloidal 
gels. Phys. Rev. Lett. 80, 778-781 (1998). 

33. Friedrich, H., Frederik, P. M., de With, G. & Sommerdijk, N. A. J. M. Imaging of 
self-assembled structures: interpretation of TEM and cryo-TEM images. Angew. 
Chem. Int. Edn 49, 7850-7858 (2010). 

34. Hohn, M. et al. SPARX, a new environment for cryo-EM image processing. 

J. Struct. Biol. 157, 47-55 (2007). 

35. Schneider, C. A., Rasband, W. S. & Eliceiri, K. W. NIH Image to ImageJ: 25 years 
of image analysis. Nat. Methods 9, 671-675 (2012). 

36. Potterton, E., Briggs, P., Turkenburg, M. & Dodson, E. A graphical user interface 
to the CCP4 program suite. Acta Crystallogr. D 59, 1131-1137 (2003). 

37. Vuolanto, A., Uotila, S., Leisola, M. & Visuri, K. Solubility and crystallization of 
xylose isomerase from Streptomyces rubiginosus. J. Cryst. Growth 257, 403-411 
(2003). 

38. Sleutel, M., Willaert, R., Wyns, L. & Maes, D. Kinetics and thermodynamics of 
glucose isomerase crystallization. Cryst. Growth Des. 9, 497-504 (2009). 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


0.8MAS — 
30 mg/ml GI 
50 mM Hepes, 100 mM MgCl2 


50 um 


PDB ID 9XIA 
a=9.47nm b=9.92nm c=10.30nm 


C 


Glucose Isomerase / mg.mI* 


ok 


1222 
>< P21212 
© GEL 


se=: C 


e,avg 


Glucose Isomerase / mg.mI? 


0 2 4 


PEG150 


30 mg/ml Gl stam 


50 mM Hepes, 100 mM MgCl2 


P2 1212 


PDB ID 10AD 
a=9.85nm b=12.96nm c=7.83nm 


1mm 


6 8 


PEG1000 / %w/v 


Extended Data Figure 1 | Phase diagrams for glucose isomerase. 

a, b, Wide-field microscopic images of the 1222 (a) and P2;2)2 (b) glucose 
isomerase (GI) polymorphs obtained with 0.8 M and 1.5M ammonium 
sulfate (AS). c, Glucose isomerase phase diagram in ammonium sulfate 
((NH4)2SOx) at 22 °C (single points represent triplicate measurements), 
showing the solubility line C...,g (dashed line). Smaller diamonds and 
crosses denote smaller numbers of crystals than larger symbols. Ce,avg is a 


mathematical average that we calculated by using the solubilities at 19°C 
and 25°C from ref. 9. d, Glucose isomerase phase diagram in PEG; 099 at 
22°C, with the C.ayg solubility line taken from ref. 38. The dotted lines, 
following the same colour code as the single points, indicate the phase 
boundaries in PEGj599. The photographs to the right are representative 
microscopy images at the indicated precipitant concentrations. 
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Extended Data Figure 2 | Induction time measurements. Induction time, 
tind aS a function of ammonium sulfate concentration. Values next to data 
points correspond to calculated supersaturation (InC/C,) values, according 
to ref. 37. 
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distance along the nanorod axis. a, Complete image acquired at x 24,000 h, Intermolecular distance along the nanorod axis compared with the 
magnification. b, CryoTEM image of single nanorods. c, Greyscale crystallographic distance along the c-axis of the prismatic crystals. The box 
values along the length of the dotted line in panel a. d, 1D-FFT of panel range shows the 25th to 75th percentiles; the horizontal line is the median; 
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Extended Data Figure 4 | Nanorod formation at early time points. CryoTEM images of crystallizing glucose isomerase solutions 20 seconds after 
protein—-precipitant mixing with 1.35 M, 1.50 M or 1.55 M ammonium sulfate. 
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Extended Data Figure 5 | 1222/P2,2,2/gel coexistence point. a—c, Crystallization experiments using the microbatch-under-oil set-up, with 86 mg ml"! 
glucose isomerase, 50 mM HEPES pH 7.0 and 100 mM MgCh, and 4% (a), 4.5% (b) or 5% (c) (w/v) PEGisop. a, 1222 crystals; b, [222 + P2,2)2, P2,22, 
P2,2)2 and gel; c, gel. 
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Extended Data Figure 6 | Time-resolved DLS of crystallizing glucose 
isomerase solutions. a, DLS time series of a crystallizing 48 mg ml“! 
glucose isomerase solution with 50 mM HEPES pH 7.0, 100 mM MgCh, 
1.5M ammonium sulfate, collected at an angle of 90°, ranging from 

30 seconds to 14 minutes after protein/precipitant mixing. R, particle 
radius. Microscopy snapshots at the right were taken ex situ after 


30 minutes. b, Time evolution (from dark to light) of the intensity 
correlation function of a 50mM HEPES pH 7.0, 100mM MgCh, 6% 
PEG jo00 (w/v) solution collected at an angle of 90°. c, Fitting of a pre- 
gelled (20 seconds; left-hand y-axis) and gelled (30 minutes; right-hand 
y-axis) sample using equations (1) and (2) respectively. Inset, wide-field 
microscopy image of the gelled state. 
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Extended Data Figure 7 | Crystallographic modelling of the nanorods. 
Models of glucose isomerase nanorods in various directions, based on 
the unit-cell dimensions of the PDB entries 9XIA and 1OAD, and the 
crystallographic symmetry elements of space groups 1222 and P2,2;2. 
The numbers designating intermolecular distances are in nanometres. 


The number in brackets for P2,2,2 (001) is the value that we obtained 
experimentally. For reference, we compare a magnified cryoTEM image 
of a single nanorod and a simulated TEM projection based on the P2;2;2 
(001) nanorod model. 
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Extended Data Figure 8 | Crystallization screening of glucose isomerase _ interactions), GI_His (perturbed C,), R387A (perturbed C2) and R331A/ 


mutants with perturbed lattice contacts. a, Initial crystallization R340D (perturbed C3). b, Cryo-TEM images of various mutants in 50 mM 
screening of mutants in 50mM HEPES pH 7.0, 100 mM MgCh, HEPES pH 7.0, 100mM MgCh, 15mg ml! mutant protein and 1.5M 
15mg ml“! of glucose isomerase mutant and 4% (w/v) PEGyo99 or ammonium sulfate, 2 minutes after protein/precipitant mixing. 


1.5M ammonium sulfate. The mutants are $171 W (with perturbed C; 
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Extended Data Table 1 | Comparison of experimental and theoretical distances for both space groups 


1222 Face do235 d200 P2, 21 2 Face doo1 d5209 
Theoretical (011) 6,975 4,961 Theoretical (110) 7.833 8.137 
(101) 7,151 4,703 (110) 7.833 8.137 
(110) 6,836 5,151 
Experimental AS 6.59 £0.15 4.72 +£0.10 Experimental AS 8.07 £0.12 8.36 £0.10 
PEG 5.02 7.08 


Distances (d) are in nanometres. AS, ammonium sulfate. 
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Extended Data Table 2 | Lattice-contact analysis of both space groups 


Space Group Contact H-bonds  Salt-Bridges Patch AASA Charges Residues 
P2122 Cl 6 0 Pil 422 (1+,4-) E167, T170, S171, Q172, G173, 
E207, R208, E210, A344, D345, 
G346, L347, Q348, A349 
P2a 320 (3+,3-)  V37, E38, R41, E70, K73, R74, 
P21212 C2 7 2 Q77, D81 
P2b 320 (2+,2-) A319, A322, D323, P324, E325, 
R374, G385, A386, R387, G388 
1222 C3 3 0 Ila 177 (2+,2-) R76, D80, D81, T82, G83, K85 
I1lb 195 (2+,1-) R331, D336, R340, P341 


The residues listed in the last column have non-zero AASA (A) values and are therefore considered to be (partially) buried. Residues that are involved in a hydrogen (H) bond or salt bridge with a 
nearest-neighbour residue are shown in bold and red, respectively. The underlined residue D81 is common to patches P2a and la. 
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Climatic control of Mississippi River flood hazard 
amplified by river engineering 


Samuel E. Munoz!?, Liviu Giosan!, Matthew D. Therrell*, Jonathan W. F. Remo®, Zhixiong Shen®’, Richard M. Sullivan)’, 


Charlotte Wiman!, Michelle O’Donnell! & Jeffrey P. Donnelly! 


Over the past century, many of the world’s major rivers have been 
modified for the purposes of flood mitigation, power generation 
and commercial navigation!. Engineering modifications to the 
Mississippi River system have altered the river’s sediment levels 
and channel morphology’, but the influence of these modifications 
on flood hazard is debated*->. Detecting and attributing changes 
in river discharge is challenging because instrumental streamflow 
records are often too short to evaluate the range of natural 
hydrological variability before the establishment of flood mitigation 
infrastructure. Here we show that multi-decadal trends of flood 
hazard on the lower Mississippi River are strongly modulated by 
dynamical modes of climate variability, particularly the El Niio- 
Southern Oscillation and the Atlantic Multidecadal Oscillation, 
but that the artificial channelization (confinement to a straightened 
channel) has greatly amplified flood magnitudes over the past 
century. Our results, based on a multi-proxy reconstruction of flood 
frequency and magnitude spanning the past 500 years, reveal that 
the magnitude of the 100-year flood (a flood with a 1 per cent chance 
of being exceeded in any year) has increased by 20 per cent over 
those five centuries, with about 75 per cent of this increase attributed 
to river engineering. We conclude that the interaction of human 
alterations to the Mississippi River system with dynamical modes 
of climate variability has elevated the current flood hazard to levels 
that are unprecedented within the past five centuries. 

Flooding of the lower Mississippi River in the spring of 2011 was 
among the largest discharge events since systematic measurements 
began in the late nineteenth century, and it caused US$3.2 billion in 
agricultural losses and damages to infrastructure®. This and other 
recent flood events on the Mississippi River—including those in 2016 
and 2017—have repeatedly, although controversially, been attributed 
to an aggressive campaign of river engineering designed and imple- 
mented over the past 150 years*-*. Federally mandated efforts to reduce 
the impacts of flooding began in the late nineteenth century and 
initially relied almost exclusively on the use of artificial levees, but this 
strategy was revised in the wake of a particularly devastating flood in 
the spring of 1927 that overwhelmed the levee system’. The current 
flood management system—the Mississippi River & Tributaries Project 
(MR&T)—includes a series of spillways that can be opened to relieve 
pressure on an enlarged levee system, as well as an artificially short- 
ened and straightened main channel that is held in place by concrete 
retaining walls (revetments) and isolated from most of its natural flood- 
plain®*’. Although these modifications are credited with protecting 
communities and croplands within the floodplain from inundation, 
artificial channelization has altered the relationship between discharge 
and river stage** and accelerated the rate of land loss in the Mississippi 
River delta’, necessitating additional investments in flood mitigation 
infrastructure and coastal restoration’. 


Although fluvial processes are sensitive to flood mitigation 
infrastructure, climate variability can also shape the dynamics 
of continental drainage networks, particularly over decadal to 
centennial timescales that are difficult to detect using short obser- 
vational records!®!'. Precipitation and soil water storage over the 
Mississippi River basin are influenced by climate variability driven by 
sea-surface-temperature anomalies in both the Pacific and Atlantic 
Oceans'*!%, Yet establishing the natural controls on discharge extremes 
of the lower Mississippi has proved challenging because gauging- 
station measurements record a limited range of variability, particularly 
before major investments were made in river engineering. As a 
result, analyses of historical streamflow records disagree over the role 
that dynamical modes of climate variability play in modulating the 
discharge!*!*!°. To plan flood mitigation and other infrastructure 
projects, it is critical to understand the climate controls on the 
discharge of the lower Mississippi River, but the short length of the 
instrumental record limits our ability to evaluate the range of natural 
hydrological variability from observational data alone. 

Recent advances in palaeoflood hydrology could extend the instru- 
mental record back in time to diagnose the controls on the discharge 
of large alluvial rivers such as the lower Mississippi. Traditional 
approaches in palaeoflood hydrology, which include the use of slack- 
water deposits as flood event indices!®, are of limited use on the low- 
relief landscapes that characterize the Mississippi River alluvial plain. 
One new approach uses the sedimentary archives held in floodplain 
lakes, which act as sediment traps during overbank floods, to develop 
continuous, quantitative and event-scale records of past flood frequency 
and magnitude'”!®, Parallel work in dendrochronology demonstrates 
that when trees are inundated by floodwaters they exhibit anatomical 
anomalies in that year’s growth ring such that they provide a precise 
chronology of flood events that occurred during the growing season”. 
Together, these methodological advances provide an opportunity to 
evaluate interannual to multi-decadal scale trends in flood frequency 
and magnitude on a large alluvial river such as the lower Mississippi, 
before and during the era of river engineering. 

Here we analyse records of individual overbank flood events derived 
from sedimentary and tree-ring archives from the lower Mississippi 
River's floodplain (Fig. 1). We collected sediment cores from the in- 
filling thalwegs of three oxbow lakes, Lake Mary (MRY), False River 
Lake (FLR) and Lake Saint John (STJ), that formed by neck cut-offs of 
the lower Mississippi River in AD 1776, AD 1722 and roughly ap 1500, 
respectively” (Extended Data Figs 1-3). In these sedimentary archives, 
we identified individual flood events by using grain-size analysis, 
bulk geochemistry (from X-ray fluorescence scanning, XRF) and 
radiography; developed age-depth models constrained by multiple 
independent chronological controls (Extended Data Figs 4-6); and 
estimated flood magnitudes from a linear model that relates the coarse 
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Figure 1 | The lower Mississippi River and the Mississippi River basin 
in North America. River engineering modifications (artificial cut-offs 
and levees) that contribute to channelization, the locations of palaeoflood 
records (FLR; MRY; STJ; and Big Oak Tree, BOT) and river gauging 
stations on the lower Mississippi used in this study (Memphis, Helena, 
Arkansas City, Vicksburg and Baton Rouge) are shown. Shaded relief 
shows relative topographic highs (dark shades) and lows (light shades) 
using the National Elevation Dataset”. 


grain-size component to the discharge of historical flood events'® 
(Extended Data Fig. 7; see Methods for details). We also include tree- 
ring records from the floodplain of the lower Mississippi, collected and 
described by ref. 21; each tree-ring series was examined for anatomical 
evidence of flood injury to produce a record of overbank flood events 
that extends back to the late seventeenth century”!. A composite time 
series for flood frequency describing the number of flood events in 
a moving 31-year window derived from sedimentary and tree-ring 
archives (Fig. 2b) is highly correlated with instrumental flood fre- 
quency (r=0.90, t= 19.12, effective degrees of freedom Ve¢¢= 3.77, 
p <0.001) for the interval of overlap, while reconstructed flood magni- 
tudes (Fig. 2c) track trends observed in gauging-station measurements 
(see Supplementary Information for additional validation), indicating 
that the palaeoflood archives provide robust reconstructions of hydro- 
logical extremes on the lower Mississippi River beyond the period of 
instrumental record. 

Our multi-proxy palaeoflood dataset extends the record of extremes 
in the discharge of the lower Mississippi River back to the early 
sixteenth century and demonstrates that both the frequency and 
magnitude of flooding have increased over the past 150 years as land use 
and river engineering efforts have intensified (Fig. 2). Flood frequencies 
and magnitudes exhibit multi-decadal oscillations that increase in 
amplitude around the beginning of the twentieth century such that 
the highest rates of overbank flooding and the largest discharge events 
of the past 500 years have occurred within the past century. The 
amplification of flood magnitudes that has occurred over the past 150 
years corresponds in time with the intensification of anthropogenic 
modifications to the lower Mississippi River and its basin, particularly 
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Figure 2 | Instrumental and reconstructed flood frequencies and 
magnitudes of the lower Mississippi River. a, Human impacts to the 
lower Mississippi River (MR&T refers to a major river engineering 
initiative): timing and intensity of agricultural land use” and river 
engineering. b, Flood frequencies (number of flood events in a 31-year 
moving window) derived from palaeoflood records, including mean and 
bootstrapped 2c confidence intervals of all palaeoflood archives, and the 
instrumental frequency of all floods attaining major flood stage (>1.5m 
above flood stage) at the Mississippi River gauging station at Baton 
Rouge (station number 07374000). c, Flood magnitudes derived from the 
sedimentary palaeoflood records, with 1o uncertainties, and instrumental 
flood magnitudes for the Mississippi River gauging station at Vicksburg 
(station number 07289000). 


the artificial channelization of the river with levees, revetments and 
cut-offs in the late nineteenth and early twentieth centuries®’. Yet the 
continued presence of multi-decadal oscillations in flood frequency 
and magnitude throughout the entire period of record indicates that 
anthropogenic modifications to the Mississippi River system are acting 
in concert with other factors to alter flood hazard through time. 

To evaluate the role of climate variability on flood hazard, we examined 
the relationships between flood frequency, the El Nifio-Southern 
Oscillation (ENSO) and the Atlantic Multidecadal Oscillation (AMO), 
to find that sea-surface temperature anomalies in both the Pacific and 
Atlantic Oceans exert a strong influence on the occurrence of lower 
Mississippi River floods (Fig. 3). Over the past five centuries, corre- 
lations between composite flood frequency and the frequency of El 
Nifio events (r =0.73) and the AMO index (r= —0.39) derived from 
instrumental and palaeoclimate data sets are significant (p < 0.001; 
see Methods for details). The strength and direction of these relation- 
ships support the hypothesis that discharge extremes on the lower 
Mississippi River arise through the interaction of ENSO, which influ- 
ences antecedent soil moisture, with the AMO, which controls the flux 
of moisture from the Gulf of Mexico inland’ !°. Extreme precipitation 
events over the Mississippi River basin are associated with a stronger 
and more westerly position of the North Atlantic Subtropical High that 
is characteristic of the negative phase of the AMO!', and these heavy 
precipitation events are more likely to generate discharge extremes 
if they fall on the saturated soils that tend to be left in the wake of El 
Nifio events’. 

Despite the strong influence of climatic variability on lower 
Mississippi River flood occurrence, the amplification of flood magni- 
tudes that we observe over the past 150 years is primarily the result of 
human modifications to the river and its basin (Fig. 4). The magnitude 
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Figure 3 | Lower Mississippi River flood frequency and its relation 

to dominant modes of climate variability. a, AMO derived from 
instrumental’’ and palaeoclimate”* datasets. b, Frequency of El Nifio 
events (the warm phase of the ENSO) in a 31-year moving window 
derived from instrumental’’ and palaeoclimate”**! data sets (mean with 
20 bootstrapped confidence interval). SST, sea surface temperature. 

c, Frequency of lower Mississippi River floods derived from palaeoflood 


of the 100-year flood (Qo; a flood with a 1% chance of exceedance in 
any year) estimated from gauging-station measurements (AD 1897- 
2015) is (20 + 7)% larger than Qj 9 for the period before major human 
impacts to the river and its basin (AD 1500-1800), as estimated from 
the palaeoflood data (see Methods for details). To identify the influence 
of human activities on this observed increase in Qi 99, we use a linear 
model that relates peak discharge to the AMO index over the period 
before major human impacts to the river, AD 1500-1800 (R*=0.35, 
degrees of freedom v= 18, p < 0.01) and use this model to predict 
flood magnitudes over the entire period of record. This ‘climate-only’ 
regression predicts that, in the absence of human modifications to the 
land surface, Qio9 would have increased by only (5 +6)% over the same 
period, accounting for only about 25% of the observed increase in Qj 9 
and implying that the remainder (about 75%) of this elevated flood 
hazard is the result of human modifications to the river and its basin. 


data (mean with bootstrapped 20 confidence interval). d, Correlation field 
of monthly precipitation*’ with the AMO” (Ap 1901-2014) smoothed 
with a common 121-month filter. e, Correlation field of monthly Palmer 
Drought Severity Index** with the Nifio 3.4 index’’ (ap 1948-2011). 
Correlation fields are interpolated to a common 2° x 2° grid, and 
individual points with significant correlations at the P < 0.05 level are 
marked with a hollow circle. 


The timing and nature of the amplification of flood magnitudes at 
the onset of the twentieth century strongly imply that it reflects the 
transformation of a freely meandering alluvial river to an artificially 
confined channel, because the confinement of flood flows to a levee- 
defined floodway can speed up the downstream propagation of a 
flood wave and increase peak discharge for a given flood’. The esta- 
blishment of widespread agricultural activity in the Mississippi River 
basin occurred in the nineteenth century, before the divergence of the 
observed and ‘climate-only’ flood magnitudes, indicating a secondary 
and possibly lagged influence of agricultural expansion”? on flood 
magnitudes relative to that of river engineering. In short, this analysis 
identifies artificial channelization of the lower Mississippi River, and 
its effects on the river’s gradient, channel area and flow velocity”, 
as having significantly increased the discharge of a given flood event 
relative to pre-engineering conditions. 
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Figure 4 | Attribution of the observed increase in flood magnitudes 
over the past five centuries. a, Composite peak discharges from 
palaeoflood archives and the instrumental record from Vicksburg. The 
red line indicates observed trends in the largest flood of the century in 
a moving window; the blue line indicates trends under ‘climate-only’ 
conditions, estimated from a statistical model (see text for details). 
Both lines are shown with 1o confidence intervals. Instrumental peak 
discharge estimates are reported without uncertainty and are therefore 


plotted without confidence intervals. b, Comparison of the 100-year flood 
observed during the baseline period (ap 1500-1800, before major human 
modifications to the Mississippi River and its basin; grey boxplot) with 
that estimated using a statistical model under ‘climate-only’ conditions 
(blue boxplot) and observed (red boxplot) during the modern period of 
instrumental record (AD 1897-2015). Boxplots show mean (centre line) 
and 1o confidence intervals (box top and bottom) for Qjo9 estimates. 
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Our main finding—that river engineering has elevated flood hazard 
on the lower Mississippi to levels that are unprecedented within the past 
five centuries—adds to a growing list of externalized costs associated 
with conventional flood mitigation and navigation projects, including 
a reduction in a river’s ability to convey flood flows**, the acceleration 
of coastal land loss® and hypoxia”*. Despite the societal benefits that 
these major infrastructure projects convey’, the costs associated with 
maintaining current levels of flood protection and navigability will con- 
tinue to grow at the expense of communities and industries situated in 
the river’s floodplain and its delta. For those interested in improving 
seasonal and longer-term forecasts of flood hazard or management 
strategies that reconnect the river with its floodplain, the Mississippi 
River’s discharge of freshwater—and by extension the flux of sediment, 
nutrients and pollutants—to its outlet should be viewed as highly sensi- 
tive both to anthropogenic modifications to the basin and to variability 
of the global climate system. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Instrumental streamflow data. We obtained daily stage data for Mississippi River 
gauges at Vicksburg (station number 07289000) and Baton Rouge (07374000) 
from the United States Army Corps of Engineers (USACE) and the United States 
Geological Survey (USGS). Discharges for the Vicksburg, Memphis (07032000), 
Helena (07047970), Arkansas City (07146500) and Baton Rouge gauges were com- 
piled from multiple sources. For the early instrumental record (pre-1927), peak 
discharges and measured discharges were compiled from historical documents». 
In the few cases in which annual peak discharges were not recorded during this 
period, we used the measured discharges to create rating curves from which to 
determine the peak discharge for the annual peak stage. Discharge data after ap 
1927 were acquired either from the USACE or from the USGS. The discharge 
record at Vicksburg is the longest and most continuous of the available discharge 
records, and its peak annual discharge is highly correlated (r > 0.86, p < 0.01) 
with that of other lower Mississippi River gauging stations in the study area (see 
Supplementary Information) and was thus used to reconstruct flood magnitudes 
from the sedimentary archives. 

Sedimentary archives. We collected sediment cores from the infilling thalwegs 
of MRY, FLR and ST] with a rod-driven vibracore system in July 2012 and March 
2016 (Extended Data Figs 1-3). For each core, we collected a replicate drive using 
a 7.5-cm-diameter polycarbonate piston corer to ensure recovery of an intact sedi- 
ment/water interface. The targeted lakes were selected because the lateral position 
of the active channel near the lake’s arm has remained relatively stable from the 
time of cut-off to the mid-twentieth century”°. We cannot eliminate the possibility 
that minor lateral and/or vertical channel migration has occurred near these lakes 
since the time of cut-off, but we reduce the influence of this potential bias on our 
analysis by (i) using a low-pass filter on the grain-size data (see below) and (ii) 
validating the resulting flood frequency and magnitude data sets against the instru- 
mental record (see Supplementary Information). At FLR and ST], mainline levees 
of the MR&T have inhibited the deposition of fluvial sediment in the lake during 
overbank floods after about ap 1950 and 1937, respectively; MRY is not protected 
by artificial levees and it continues to be inundated during overbank floods. Oxbow 
lakes can continue to exchange water and sediment with the main channel when 
the river is below flood stage” to create high rates of fine-grained ‘background 
sedimentation that differs in texture and composition from the coarser material 
that is mobilized during high-magnitude flood events. Cores were collected 
along an arm of the oxbow lakes at locations proximal to the ‘plug’ that separates 
the active channel from the lake to maximize the contrast between background 
and flood event sediments. Core locations at each site were targeted based on 
bathymetric surveys before core collection. 

Cores were transported back to the Woods Hole Oceanographic Institution 
(WHOD) where they were split, described and photographed. Archived core halves 
were subjected to high-resolution XRF (4,000 1m resolution) and radiography 
(200|.m resolution) in an ITRAX core scanner housed at WHOL. For grain-size 
analysis, sediment sub-samples at continuous 1-cm intervals were dispersed in 
water using a vortex mixer before 5s sonication and analysis in a Beckman Coulter 
LS 13 320 laser diffraction particle-size analyser; randomly selected replicate 
samples showed a <1% volume difference in any detector. Complex, multi-modal 
grain-size distributions were modelled as mixtures of discrete, simple distributions 
and decomposed using end-member calculations into four representative popula- 
tions, or end-members (EMs), that were considered geologically meaningful, using 
the EMMAgeo package run in RStudio. The score of each sample on the coarsest 
end-members (EM1), representing deposition of bedload during overbank floods'’, 
was normalized with a low-pass (41-cm) moving minimum filter to remove long- 
term trends in sediment composition caused by local geomorphic processes. We 
then identified potential flood deposits as normalized EM1 scores that exceeded a 
high-pass (11-cm) moving mean with a 0.1 EM1 score threshold, and we verified 
identified peaks against the XRF and radiography (Extended Data Figs 4-6). 

To estimate flood magnitudes from the sediment records, we used the method 
of ref. 18 and developed linear models that describe the normalized EM1 scores 
as a function of historical flood event discharge at the Mississippi River gauging 
station at Vicksburg. Using this, we assigned each flood deposit to a historical flood 
event approximating ‘major flood stage’ as defined by the USGS at a nearby gaug- 
ing station, in stratigraphic order, and within the 20 age estimate for the deposit 
(Extended Data Fig. 7). The requirement for flood deposits to be assigned to 
historical floods in stratigraphic order eliminated ambiguity in cases in which 
more than one historical flood fell within a deposit’s 2c age estimate. There were 
no cases for which a flood deposit could not be assigned to a historical flood within 
the period of instrumental observations (Ap 1897-2015), but there were three 
cases at FLR (ap 1944, 1929 and 1920) and two cases at STJ (ap 1920 and 1913) 
for which a major historic flood did not leave an identifiable flood deposit. These 
‘missing’ flood deposits are rare and occurred during periods of high flood 
frequency, and they may reflect reduced sediment availability’ during these events. 
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The sedimentary record reconstructs peak annual discharge at the Vicksburg 
gauge, not at individual site locations. 

We developed age-depth models using Bacon v.2.2**, a Bayesian age—depth 
modelling program, informed by multiple independent dating techniques (see 
Supplementary Information), including: (i) !37Cs and !°Pb activity in desiccated 
and powdered bulk sediment samples in a Canberra GL2020RS well detector for 
low-energy germanium gamma radiation, for which we used the constant rate of 
supply model”? to estimate the age of a sampled depth; (ii) radiocarbon (‘4C) dating 
via accelerator mass spectrometry of a terrestrial plant macrofossil at the National 
Ocean Sciences Accelerator Mass Spectrometers facility at WHOL, calibrated using 
the IntCal13 curve embedded in Bacon; (iii) optically stimulated luminescence 
(OSL) dating with the fast component of silt-sized quartz‘° using a Riso DA-15 B/C 
luminescence reader at the University of Liverpool, UK; (iv) core tops as the date 
of collection and, when appropriate, the age of lake formation” as the core bottom. 
Sedimentation rate priors were increased to near-instantaneous rates through thick 
(>20cm) flood deposits!”. 

Tree-ring records. Tree-ring samples from 33 living and 2 dead oak (Quercus lyrata 
and Q. macrocarpa) trees were collected from Big Oak Tree State Park (BOT) in 
southeast Missouri*!. One to four core samples were extracted from each tree at 
or below breast height (about 1.4m) using a 5-mm-diameter Swedish increment 
borer. Cross-sections from dead trees were collected as close to the base of the tree 
as possible. All samples were absolutely cross-dated using the skeleton-plot method 
of dendrochronology. Tree-ring widths were measured on a stage micrometer to 
a nominal resolution of 0.001 mm. We crosschecked the accuracy of our visual 
dating using the computer program COFECHA. We visually determined flood- 
ring years by examining each tree-ring series for any evidence of flood injury con- 
sistent with the anomalous anatomical features caused by flooding as described by 
previous flood-ring studies!’. Additional characteristics used in our identification 
included ‘jumbled ranks’ or ‘additional ranks’ of early wood vessels or zones of 
‘extended earlywood and disorganized flame parenchyma as well as ‘offset’ early 
wood ranks!’. We used the same criteria as ref. 21 to identify flood events (that is, 
a year in which more than 10% of sampled trees exhibited signs of flood injury) as 
this threshold encompasses all historic floods that attained major flood stage and 
occurred during the growing season”), 

Historical climate and palaeoclimate data. Historical (late nineteenth century to 
present) indices of ENSO and AMO”’ were extended back to the sixteenth century 
with annual palaeoclimate reconstructions of ENSO***! and AMO“!. To compare 
the ENSO series, we identified El Nifio events in the historical Nifio 3.4 index as 
periods of five consecutive overlapping 3-month windows at or above +0.5°C, 
and as years with anomalies of more than +0.5°C in the palaeoclimate series. 
We then derived El Nifio event frequencies using a 31-year moving window on 
each record, and we computed the mean of the historical and all palaeoclimate El 
Nifio frequencies and bootstrapped 20 confidence intervals using the boot func- 
tion in RStudio. For the composite AMO series, we used the detrended historical 
AMO index” back to ap 1871, and then transitioned to a palaeoclimate AMO 
reconstruction"! to ap 1572. We sampled this composite AMO index at the median 
age probability of the 20 palaeofloods that occurred between ap 1500-1800, and 
used these data to develop a linear model (using the /m function in RStudio) that 
relates peak discharge from the AMO index; the El Nifio frequency timeseries 
was not a significant predictor of flood magnitudes, presumably because Pacific 
sea-surface temperatures do not control the inland flux of Gulf of Mexico mois- 
ture that triggers high-magnitude discharge events’®, so only the AMO index was 
used to statistically estimate flood magnitudes under ‘climate-only’ conditions. 
The AMO is detrended to remove recent warming of North Atlantic sea surface 
temperatures, so the ‘climate-only’ estimates of Qjo9 do not consider the potential 
effects of recent greenhouse warming on flood magnitudes—although we note that 
the inverse relationship between AMO and Mississippi River flood magnitudes 
implies that warming of North Atlantic sea-surface temperatures would act to 
suppress flood magnitudes. When evaluating the significance of Pearson corre- 
lations between climate and hydrological time-series that exhibited high degrees 
of serial autocorrelation, we estimated the effective degrees of freedom with the 
following relation’: 


vert = N(1-2,9,)/(. + 2%) (1) 


where N is the number of independent samples, and (p, and , are the lag-1 auto- 
correlation coefficients of time series x and y respectively. 

Flood hazard attribution. The magnitude of Qio9 was estimated both empirically 
and through statistical modelling. The sedimentary palaeoflood archives record 
major flood events over periods greater than 100 years, and are suitable for esti- 
mating recurrence intervals empirically through the relation: 


t,=(n+1)/m (2) 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


where tf, is the recurrence interval (the inverse of t, is the probability that the 
event magnitude will be exceeded in any one year), is the number of years in 
the window being considered, and m is the number of recorded occurrences of 
the event being considered. The same approach was used to estimate Qjo9 in the 
statistically modelled ‘climate-only’ peak annual discharges derived from palaeo- 
climate and historical climate records. The instrumental record at the Vicksburg 
gauge provides a measurement for peak annual discharge in every year, but is 
relatively short, so the modern Qyo9 was estimated statistically by fitting a log 
Pearson type III distribution to the data set following standard protocols out- 
lined by the United States Interagency Advisory Committee of Water Data’ for 
instrumental hydrological data sets. We compared the observed Qjo9 baseline (AD 
1500-1800) with the observed and ‘climate-only’ Qio9 estimates for the modern 
period (Ap 1897-2015) and attributed the proportion of the observed change that 
was not explained by the ‘climate-only’ estimates to human alterations to the river 
channel and basin. The modern Qj 9 estimated empirically from sedimentary 
records and the modern Qyj9 estimated by fitting a generalized extreme value 
distribution to the instrumental data both fall within the 1a confidence intervals 
of the modern Qj estimated by fitting a log Pearson type III to the instrumental 
record (see Supplementary Information), indicating that our findings are robust 
to different estimations of flood hazard. 

Data availability statement. The datasets generated by this study are available as 
Supplementary Data. 

Code availability. The R code used to produce the figures in this paper is available 
from the corresponding author on reasonable request. 
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Extended Data Figure 1 | Location of Lake Mary, Mississippi (MRY) be inundated during overbank floods. Bathymetric contours (white) given 


and sediment core (MRY2) used in this study. Lake Mary is an oxbow in metres. Shaded relief shows relative topographic lows (dark shades) and 
lake that formed via neck cut-off of the lower Mississippi River in AD highs (light shades) according to the National Elevation Dataset”. 


1776” and is situated inside the modern floodway such that it continues to 
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Extended Data Figure 2 | Location of False River Lake, Louisiana, and (white) given in metres. Shaded relief shows relative topographic lows 
sediment core (FLR1) used in this study. False River Lake is an oxbow (dark shades) and highs (light shades) according to the National Elevation 
lake that formed via neck cut-off of the lower Mississippi River in aD Dataset”®. 


1722” and is situated outside the modern floodway. Bathymetric contours 
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Extended Data Figure 3 | Location of Lake Saint John, Louisiana, and (white) given in metres. Shaded relief shows relative topographic lows 
sediment core (STJ1) used in this study. Lake Saint John isan oxbow lake (dark shades) and highs (light shades) according to the National Elevation 
that formed via neck cut-off of the lower Mississippi River in about ap Dataset”. 


1500”° and is situated outside the modern floodway. Bathymetric contours 
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Extended Data Figure 4 | Radiography, bulk geochemistry, grain size and chronology of core MRY2. The age-depth model at right shows the median 
age probability (black line) and 1c confidence intervals (grey shading), with 2c confidence intervals on individual chronological controls. 
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Extended Data Figure 5 | Radiography, bulk geochemistry, grain size and chronology of core FLR1. The age-depth model at right shows the median 
age probability (black line) and 1o confidence intervals (grey shading), with 20 confidence intervals on individual chronological controls. 
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Extended Data Figure 6 | Radiography, bulk geochemistry, grain size and chronology of core STJ1. The age-depth model at right shows the median 
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age probability (black line) and 1c confidence intervals (grey shading), with 2c confidence intervals on individual chronological controls. 
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Extended Data Figure 7 | Relationships between peak annual discharge 
and normalized EM score for historical floods in sedimentary archives. 
Scatterplots and linear regressions with 1 prediction intervals relating 
normalized EM score (a measure of grain size) to peak annual discharge 
of historical flood events for (a) MRY, (b) FLR and (c) STJ. Peak annual 
discharge estimates are from the Mississippi River gauging station 

at Vicksburg. Calibration periods vary owing to site-specific factors 
discussed in the Methods and Supplementary Information. adj., adjusted. 
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Shifts in tree functional composition amplify the 
response of forest biomass to climate 


Tao Zhang!, Ulo Niinemets?, Justin Sheffield? + & Jeremy W. Lichstein! 


Forests have a key role in global ecosystems, hosting much of 
the world’s terrestrial biodiversity and acting as a net sink for 
atmospheric carbon!. These and other ecosystem services that are 
provided by forests may be sensitive to climate change as well as 
climate variability on shorter time scales (for example, annual to 
decadal)?-*. Previous studies have documented responses of forest 
ecosystems to climate change and climate variability”*, including 
drought-induced increases in tree mortality rates’. However, 
relationships between forest biomass, tree species composition and 
climate variability have not been quantified across a large region 
using systematically sampled data. Here we use systematic forest 
inventories from the 1980s and 2000s across the eastern USA to 
show that forest biomass responds to decadal-scale changes in water 
deficit, and that this biomass response is amplified by concurrent 
changes in community-mean drought tolerance, a functionally 
important aspect of tree species composition. The amplification 
of the direct effects of water stress on biomass occurs because 
water stress tends to induce a shift in tree species composition 
towards species that are more tolerant to drought but are slower 
growing. These results demonstrate concurrent changes in forest 
species composition and biomass carbon storage across a large, 
systematically sampled region, and highlight the potential for 
climate-induced changes in forest ecosystems across the world, 
resulting from both direct effects of climate on forest biomass and 
indirect effects mediated by shifts in species composition. 

How forests respond to climate variability has important implications 
for their future provisioning of ecosystem services, including carbon 
storage, timber, wildlife habitats and regulation of the hydrological 
cycle’”, Long-term demographic and geographic responses of tree 
species to climate change have been documented®’, and numerous 
studies have reported the effects of drought and other extreme events on 
tree growth and mortality”->. However, we have limited understanding 
of how the functional composition of tree communities responds to 
climate variability. It is not known how fast functional shifts occur; for 
example, whether one or two decades of relatively dry or wet conditions 
are sufficient to shift tree communities towards more or less drought- 
tolerant species. It is also unclear how such shifts affect the response 
of forest biomass (a key component of terrestrial carbon storage) 
to climate variability; that is, whether functional shifts moderate 
drought-induced biomass loss (as suggested by ecosystem model 
projections that demonstrate how such shifts can increase forest resilience 
to climate change®) or whether functional shifts amplify drought- 
induced biomass loss (for example, if competition under water-limited 
conditions drives plants to invest more in roots than is optimal for 
biomass production’). 

The eastern USA has experienced substantial climate varia- 
bility over recent decades, with some areas becoming wetter and 
others drier'® (Fig. 1a and Extended Data Fig. la-d). This variability— 
combined with systematically sampled Forest Inventory and Analysis'! 
(FIA) data spanning millions of individual-tree records over several 


decades—provides a valuable opportunity to quantify climate-induced 
changes in forest biomass and species composition. Forests of the east- 
ern USA are influenced by various factors, including regrowth and 
successional dynamics after agricultural abandonment and logging”, 
fire suppression’, introduced pathogens and insects", increases in 
deer (Odocoileus virginianus) populations’ and natural disturbances"*. 
Despite the importance of these non-climatic factors, correlations 
between precipitation change and recent shifts in the abundances and 
geographic ranges of tree species'” suggest that climate variability has 
a tangible effect on forest dynamics in the eastern USA, as observed 
elsewhere!*®. However, there is limited understanding of how shifts in 
forest functional composition affect the response of ecosystem prop- 
erties, such as forest biomass, to climate variability. 

To gain insights into how functional shifts affect the responses of 
ecosystems to climate variability, we quantified relationships among 
tree functional composition, forest biomass and water availability. We 
used the Palmer drought severity index (PDSI, an index of soil moisture 
based on the balance between precipitation and modelled evapotran- 
spiration and run-off’’) to quantify changes in mean growing season 
water availability (APDSI) from the 1980s to 2000s in 1° latitude x 1° 
longitude grid cells (Fig. la). A one-unit decrease in growing-season 
PDSI is equivalent, on average, to a 23% reduction in growing-season 
precipitation in the eastern USA (Extended Data Fig. 1i-k). To quantify 
changes in community-mean drought tolerance (ADT, a functional 
component of tree species composition) and above-ground biomass 
(AAGB; Fig. 1 and Extended Data Figs 2, 3), we used FIA data while 
controlling for changes in forest stand age due to recovery from previ- 
ous disturbance. Specifically, ADT and AAGB were calculated for each 
grid cell by comparing inventory plots in a given age class in the 1980s 
to plots in the same age class in the 2000s. ADT was calculated by 
combining FIA data in each decade, grid cell and age class with a 
species-specific drought-tolerance ranking (DT”°; Supplementary 
Methods 1), which increases from 1 (very intolerant) to 5 (very 
tolerant); therefore, ADT > 0 indicates a shift in tree species composi- 
tion towards a more drought-tolerant community (see examples in 
Supplementary Methods 2). Although the DT scale is arbitrary, such 
rankings are widely used by plant ecologists and foresters!®!””', and 
modifying the scale would not qualitatively affect our results 
(Supplementary Methods 1). DT has been estimated for most of the 
common tree species in the eastern USA”®, enabling a systematic, 
community-level analysis of around 3,000,000 individual-tree records 
from around 100,000 FIA plot inventories (Extended Data Fig. 4a). 

Relationships among ADT, AAGB and APDSI were statistically 
significant (P < 0.05) in most of the 15 cases (3 pairwise correlations x 5 
age classes). Specifically, from the 1980s to the 2000s, community-mean 
drought tolerance tended to increase with increasing water stress 
(negative correlation between ADT and APDSI; Fig. 2a), biomass 
tended to increase with water availability (positive correlation between 
AAGB and APDSI; Fig. 2b) and biomass tended to decrease with 
increasing community-mean drought tolerance (negative correlation 
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Figure 1 | Changes in growing-season Palmer drought severity index, 
community-mean drought tolerance and above-ground biomass 
between the 1980s and 2000s in the eastern USA. a, APDSI. b, ADT. 

c, AAGB. ADT and AAGB were calculated by comparing forest inventory 
plots that were 60-80 years old in the 1980s to plots in the same age class 


between AAGB and ADT; Fig. 2c). These correlations suggest that 
there are concerted changes in forest functional composition and bio- 
mass in response to changes in water availability. 

To determine whether the above correlations (Fig. 2) were robust, 
we performed additional analyses to quantify relationships among 
ADT, AAGB and APDSI while controlling for changes in forest stand 
age and other potentially confounding variables (length of growing 
season, community-mean shade tolerance, tree harvesting, and/or spatial 
autocorrelation that may arise from factors not included in our statistical 
models; Extended Data Figs 5, 6 and Supplementary Methods 3). 
We used three statistical methods (spatial autoregressive models, struc- 
tural equation modelling and independent effects analysis), all of which 
yielded consistent results (compare Extended Data Figs 4, 6 
and 7) and demonstrate that the correlations (Fig. 2) are qualitatively 
robust. The analyses also revealed variation among forest age classes in 
terms of the relative importance of different variables affecting ADT 
and AAGB (Extended Data Fig. 7), highlighting the need to study forest 
dynamics across successional stages and across life stages (for example, 
seedlings versus adults). However, in all age classes, ADT hada strong 
negative correlation with changes in community-mean shade tolerance 
(AST; Extended Data Fig. 7d), as expected from the interspecific trade- 
off between tolerances to shade and drought”. Shifts in ST within forest 
age classes may reflect changing disturbance regimes! or may be 
simply a consequence of PDSI-induced changes in DT. Although our 
methods cannot determine whether changes in ST are a cause or a 
consequence of changes in DT, our analyses demonstrate significant 
relationships among ADT, AAGB and APDSI that are independent 
of AST and other potentially confounding variables. These results are 
further corroborated by a stand-level analysis that tracks the dynamics 
of individual FIA plots that have been measured and remeasured since 
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in the 2000s (other age classes in Extended Data Figs 2, 3). Colour scales 
are oriented so that red corresponds to decreased moisture (in PDSI units; 
Upps1), increased DT (in DT units; Upr) and decreased AGB. White grid 
cells lack sufficient inventory data. 


the 1990s (Supplementary Methods 4). This stand-level analysis shows 
that the qualitative patterns that emerge at the 1° grid-cell scale over 
two decades are also detectable at the stand level over shorter time 
periods (five-year mean remeasurement interval; Extended Data 
Fig. 4d, e). 

The effects of PDSI on AGB in some grid cells were of a similar 
magnitude to the effects of other important global change drivers on 
AGB. For example, compared to the case in which there is no change 
in PDSI, a decrease of two PDSI units (which was exceeded in 11% of 
grid cells; Extended Data Fig. 1c, d) would cause a reduction in AGB of 
8.6-14.3 Mg ha"! over two decades (based on the slopes in Fig. 2b), or 
7-19% of the mean AGB in different forest age classes (Extended Data 
Fig. 3). This response of AGB (equivalent to 0.21-0.36 Mg C ha“ yr~') 
is similar to the estimated effects of nitrogen deposition on AGB in the 
USA” (0.12-0.37 Mg C ha™' yr~'), as well as the AGB component of 
the USA forest carbon sink” (0.30-0.46 and 0.37-0.56 Mg Cha“! yr“, 
for the 1990-1999 and 2000-2007 periods, respectively), which reflects 
the combined effects of several drivers (for example, forest regrowth, 
nitrogen deposition and CO; fertilization). Our analyses suggest that 
in the absence of climate variability, the eastern- USA carbon sink 
would have been even stronger over recent decades, because the mean 
APDSl1 across the eastern USA was —0.61 (Extended Data Fig. 1d), 
which suggests a weakening of the eastern-USA forest carbon sink by 
approximately 0.07-0.11 Mg Cha™! yr7!. 

The response of AGB to PDSI includes not only direct effects of 
APDSI (that is, changes in AGB that would occur in the absence of 
changes in community-mean drought tolerance, DT), but also indirect 
effects of APDSI (that is, changes in AGB caused by PDSI-induced 
changes in DT; Extended Data Fig. 8a, b). Our analysis reveals that 
direct and indirect effects both work in the same direction, which 
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Figure 2 | Relationships between changes in community-mean drought 
tolerance, water availability and above-ground biomass within forest 
age classes from the 1980s to 2000s. a—c, Relationships between ADT and 
APDSI (a), AAGB and APDSI (b) and AAGB and ADT (c). Each point 
represents a forest age class within a 1° grid cell. Lines are ordinary least 
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squares regressions. P values (two-sided) for regression slopes are for the 
following age classes (from top to bottom): 0-20, 20-40, 40-60, 60-80 and 
80-100 years. Pearson correlation ranges are 0.11-0.16 (a), 0.22-0.30 (b) 
and 0.24-0.40 (c). With outliers removed (not shown), all correlations 
have P< 0.05. Sample sizes are in Extended Data Fig. 4a. 
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Figure 3 | Changes in species composition amplify the response of 
biomass to climate variability. a-c, AAGB versus APDSI relationships 
for the three forest age classes with a significant response of ADT to 
APDSI: 40-60 years (a), 60-80 years (b) and 80-100 years (c). Dashed 
lines show direct effects of APDSI on AAGB (controlling for ADT); solid 


means that changes in species composition (indirect effects) amplify 
the response of AGB to PDSI (Fig. 3). Indirect effects accounted for 
roughly 20-30% of the total response of AAGB to APDST in 40- to 
100-year-old forests, significantly amplifying AGB responses in these 
age classes (Extended Data Fig. 8c, d). 

The amplification of the biomass response to climate variability (Fig. 3) 
is probably driven by competitive shifts in the traits of the dominant 
tree species. For example, as water availability declines, competition 
may favour tree species that allocate more carbon to fine roots and 
less to leaves and wood than is optimal for biomass production’. This 
shift in allocation amplifies the response of the ecosystem to decreasing 
water availability, because the most competitive species under dry con- 
ditions have lower biomass than other species that could have persisted 
were they not outcompeted. 

Shifts in community composition towards more- or less-drought- 
tolerant species may be caused by changes in relative species rankings 
in one or more demographic parameters (mortality, growth and recruit- 
ment), and are not necessarily caused by die-offs due to physiological 
stress. We decomposed stand-level ADT into different components 
using data from remeasured inventory plots (Supplementary 
Methods 4, 5). Mortality, growth and recruitment all contributed sub- 
stantially to ADT (Fig. 4a), with the vast majority of ADT accounted 
for by shifts in the abundance of species present at both measurement 
times (as opposed to species additions or losses; Fig. 4b). Mortality and 
species loss were most important in old stands, and recruitment and 


a ADT <0 ADT > 0 b ADT <0 ADT >0 
a> 
2 oot g 
= - 
as 
Os H c= 
5 z s H Age class (yt) ( = 
6 6 0-20 ig 
aa ; |_| 20-40 
65 : 40-60 sy 
os 60-80 3 
£ = 80-100 
100 50 0 50 100 100 50 0 50 100 


Component contribution (%) 


Figure 4 | Contributions of the different components to the change in 
community-mean drought tolerance. Component contributions are 
shown to the left of zero for negative ADT (lower quartile of ADT 
distribution), and to the right of zero for positive ADT (upper quartile). 
Error bars show one side of each 95% confidence interval. a, Contributions 
of mortality, growth and recruitment to ADT. b, Contributions of species 
additions, losses and abundance shifts (‘remain’) to ADT. Sample sizes 
(number of remeasured plots within each quartile) are 300 (0-20 years), 
588 (20-40 years), 1,173 (40-60 years), 1,390 (60-80 years) and 650 
(80-100 years). See Supplementary Methods 5 for details. 


lines show total (direct + indirect) effects (including ADT; Extended Data 
Fig. 8). Lines are labelled with slopes (Mg ha! Upps_'). P values are based 
on s.e.m. of slope differences; n is number of grid cells. The x axes span the 
mean + 2 s.d. from Fig. la. 


species additions were most important in young stands (Fig. 4). 
These age-dependent trends probably reflect the larger size but lower 
density of trees in older forests; for example, the death of a single large 
tree can substantially affect DT (a size-weighted community average), 
whereas juvenile recruitment would have little immediate effect on DT 
in a mature forest. 

Consistent with the importance of abundance shifts at the stand- 
level, grid-cell-scale relationships among ADT, AAGB and APDSI 
reflect the collective responses of many species, without major changes 
in regional species diversity. Analysis of the influence of individual 
species on the changes in community-level DT and AGB revealed no 
systematic differences between angiosperms and gymnosperms 
(Extended Data Fig. 9), and no systematic relationships between 
species influence and either shade tolerance or wood density 
(Supplementary Methods 6, 7 and Supplementary Tables 1-5). Thus, 
the ADT and AAGB responses reflect the collective responses of both 
angiosperm and gymnosperm species spanning a wide range of eco- 
logical strategies. Species abundance distributions and the number of 
common species within sub-regions (north-central, northeastern and 
southeastern USA) were similar between the 1980s and 2000s 
(Extended Data Fig. 10) despite substantial changes in the abundance 
of some individual species (Supplementary Tables 6-20). Some of 
these species-level changes were inconsistent with the overall 
community-level ADT and AAGB responses, and may reflect long- 
term changes in disturbance regimes rather than climate responses’? 
For example, increases in abundance of two widespread maple species, 
Acer rubrum and Acer saccharum, are probably due to fire suppression 
and the resulting mesophication of eastern USA forests!?, rather 
than a response to APDSI. Thus, our study does not indicate that 
climate-driven functional shifts are the primary mode of change in 
eastern USA forests. Nevertheless, our results support recent evidence 
that climate-induced shifts in species abundances and geographic 
ranges have occurred over recent decades in the eastern USA", and 
here we demonstrate for the first time that these shifts have important 
consequences at the ecosystem level. 

The response of eastern USA forests to climate variability, evident 
from shifts in both forest biomass and species composition, is of global 
importance for several reasons. First, these shifts—quantified from 
systematic, regional-scale forest inventories—suggest that even greater 
changes may be underway in other regions where recent drought and 
climate change have been more severe than in the eastern USA; for 
example, in Amazonia and western North America, where drought- 
induced increases in tree mortality have already been reported***”>. 
Second, projected increases in the frequency and severity of extreme 
weather events in many regions of the world”®—combined with the 
sensitivity of forest biomass and species composition to climate varia- 
bility documented here and elsewhere*’—suggests potential changes 
in global forests over the next century that are both ecologically and 
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economically important. Finally, amplification of the biomass—climate 
response due to shifts in species composition (temporal beta diver- 
sity") contrasts with evidence that local (alpha) diversity increases eco- 
system stability’’, including increased resistance to climate extremes*”, 
These contrasting effects of alpha and beta diversity highlight the need 
to better understand how different components of biodiversity, includ- 
ing changes in species composition, affect ecosystem functioning at 
different spatial and temporal scales. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Overview. We used simple correlations and several multivariate methods to quan- 
tify relationships among changes in community mean drought-tolerance (ADT), 
above-ground forest biomass (AAGB) and growing-season water availability 
(APDSI) in the contiguous USA east of 95° longitude. Most analyses were executed 
at the 1° latitude x 1° longitude grid-cell scale and controlled for changes in stand 
age by comparing forest inventory plots in the 1980s to plots in the same grid cell 
and age class in the 2000s. The following sections describe these grid-level 
analyses. Stand-level analyses that tracked the dynamics of remeasured inventory 
plots are described in Supplementary Methods 4, 5. In addition to controlling for 
changes in stand age, our multivariate analyses also controlled for changes in 
community-mean shade-tolerance (AST, which could be confounded with ADT 
owing to the negative correlation between shade- and drought-tolerance”’) and 
growing-season length (AGSL, which could be confounded with APDSI, since both 
depend on temperature). We further controlled for potentially confounding effects 
of unmeasured, spatially structured variables using simultaneous autoregressive 
(SAR) models*!*?. To evaluate the robustness of the SAR model results, we used 
two additional multivariate methods: structural equation modelling? (SEM) and 
independent effects analysis*4 (IEA). 

Quantifying decadal climate variability. We used the change in the PDSI during 
the growing season from the 1980s to 2000s to quantify decadal climate variability. 
The growing season was defined as the period in a given year between the last 
frost (minimum daily temperature below 0°C) in the spring and the first frost 
in the fall. We identified the growing season in each grid cell each year using 
temperature information in the 1° spatial, three-hourly, global climate reanalysis 
dataset (http://hydrology.princeton.edu/data.pgf:php) from ref. 35. Mean growing- 
season PDSI was calculated for each grid cell in each year as the weighted mean of 
monthly PDSI, in which the weights are the number of days in a given month that 
were included in the growing season. For example, if the growing season in a given 
grid cell and year began on May 20 and ended on September 10, then the weights 
for May, June, July, August and September, respectively, would be 12, 30, 31, 31 
and 10. Monthly PDSI in each grid cell was calculated as the median of 16 versions 
of a global (1° latitude x 1° longitude) PDSI dataset (http://hydrology.princeton. 
edu/data.pdsi.php) from ref. 19. The 16 PDSI versions are derived from the 16 
combinations of 4 global precipitation datasets, 2 PDSI models and 2 calibration 
methods (see ref. 19 for details). Preliminary analyses showed that our results were 
qualitatively robust to variation among the 16 PDSI versions. 

After calculating the mean growing-season PDSI for each grid cell in each year, 

the PDSI change (APDSI) between the 1980s and the 2000s was quantified as 
APDSI= PDSlhoo0s = PDSlyog0s, in which PDSlhog0s and PDSlIho00s are the means 
of the growing season PDSI during the two decades (1980-1989 and 2000-2009, 
respectively). Similarly, the change in growing season length (AGSL) between the 
two decades was quantified as AGSL = GSLz2990s — GSLigg055 in which GSLj9g05 and 
GSLz000s are the GSL means in the 1980s and 2000s, respectively. 
Description of the US Forest Inventory and Analysis database. Tree species com- 
position and biomass in each grid cell were quantified from tree-level data reported 
in the US Forest Inventory Analysis (FIA) database (http://www.fia.fs.fed.us/), 
which reports the diameter at breast height (DBH) and species identity of indi- 
vidual trees in forest inventory plots that are geographically distributed to provide 
unbiased estimates of forest biomass and tree population attributes!!. There is 
roughly one plot per 25 km’, and each plot samples an area of approximately 0.1 
ha. Since FIAs national standardized design was implemented (nominally in the 
year 1999, although the exact year varies by US state), plots in the eastern USA have 
been remeasured roughly every 5 years as follows: trees with DBH > 12.7 cm are 
inventoried on four 7.32-m radius subplots per plot, and trees with DBH between 
2.54 and 12.7 cm are inventoried on four 2.07-m radius microplots per plot. Prior 
to around 1999, sampling designs and remeasurement intervals varied among 
US states, but all state inventories aimed to yield unbiased population-level sta- 
tistics!!. Most eastern USA states were inventoried in both the 1980s and 2000s, 
although the same plot locations were not always resampled. Some states were 
also inventoried in the 1990s, but we focused our analysis on changes from the 
1980s to 2000s owing to inconsistent data availability for the 1990s. We assigned 
each FIA plot to a 1° grid cell based on approximate plot coordinates reported by 
FIA. Errors in plot coordinates reported in the public FIA database (see details in 
ref. 36) should have minimal effect on our 1° grid-cell-scale analyses. Owing to 
frequent discontinuities in sampling designs and/or plot locations, it is often not 
possible to track the dynamics of individual plots between the pre- and post-1999 
periods. Furthermore, the unique plot identifiers needed to connect plot records 
across these periods are not provided in the public FIA dataset. However, it is 
possible to track the dynamics of individual plots during the post-1999 period, 
and we used plots remeasured during this period to perform stand-level analyses 
(Supplementary Methods 4). 
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All analyses presented in our paper are based on FIA plots in naturally regen- 
erated (non-plantation) forest. However, many non-plantation forests in the 
USA are managed for timber or other wood products, with management regimes 
ranging from infrequent selective harvest to periodic clear-cutting. Our analyses 
account for stand-replacing disturbance (including intensive logging) by separately 
analysing different stand-age classes. Also, unless stated otherwise, our analyses 
minimized the effects of selective harvest by excluding data from FIA plot inven- 
tories where one or more harvested trees were reported. However, including selec- 
tively harvested plots and accounting for harvest effects does not qualitatively affect 
our results (Supplementary Methods 3 and Extended Data Fig. 6q). 

Stand-age classes. To control for age-related changes in tree species composi- 
tion and biomass, we controlled for changes in stand age in our grid-cell-scale 
analyses by comparing forest inventory plots in the 1980s to plots in the same 
grid cell and age class in the 2000s. We assigned each plot inventory to one of 
the following age classes based on stand ages reported by FIA (defined as the 
mean age of trees in the dominant size class*”): 0-20, 20-40, 40-60, 60-80 or 
80-100 years. The width of each age class (20 years) matches the timescale of 
the change analysis (1980s to 2000s). Therefore, plot locations that were meas- 
ured in both decades were typically assigned to different age classes in the 1980s 
and 2000s; for example, a plot that was in the 40-60 year age-class in the 1980s 
was typically in the 60-80 year age-class in the 2000s. Thus, our grid-cell-scale 
analysis of change from the 1980s to 2000s involves the comparison of two 
largely independent samples (that is, two different sets of plots with similar mean 
age), which minimizes the potential for successional dynamics to influence the 
results. Grid-cell-scale analyses for a given age class included only those grid cells 
for which at least five FIA plots in both the 1980s and 2000s met our filtering 
criteria; for example, plots without recent selective harvest and where species with 
unknown DT comprised <20% of plot AGB (see ‘Quantifying decadal changes 
in forest tree species composition). 

Quantifying decadal changes in forest tree species composition. For each 
age class within each grid cell and decade (1980s and 2000s), we quantified 
community-mean drought tolerance (DT) based on the size and species identity 
of live trees in FIA plots as DT = LI AGBDT: in which n is the number of individual 


i=1 7 
trees; DT; is the drought tolerance index for the species of individual i (the DT 
index is described in detail and evaluated with independent data in Supplementary 
Methods 1); and AGB; is the estimated above-ground biomass density (Mg ha!) 
of individual i. To estimate AGB;, we combined DBH measurements reported by 
FIA with biomass allometries derived for USA tree species groups** to estimate the 
above-ground biomass (Mg) of tree i, which was then converted to Mg ha units 
by multiplying biomass (Mg) by the number of trees per hectare (ha™) represented 
by tree i. These ha“! values are reported by FIA as trees per acre, which is the inverse 
of the sample area for tree i; for example, in the national standardized plot design", 
the sample area for a tree with DBH >12.7 cm is the number of subplots per plot 
(four) x the subplot area (0.0168 ha). For each grid cell, the change in community- 
mean drought tolerance was then calculated as ADT = DTso00; — DThogos- 

The DT index was unavailable for some eastern USA tree species (which 
accounted for 0.10% of the individuals in our analysis), and some individuals were 
identified only to the genus level in the FIA database (2.1% of individuals in our 
analysis). Individuals identified only to the genus level were split into expected 
contributions of different species as follows: if any congeneric individuals present 
in the same grid cell and age class (in either the 1980s or 2000s) were identified 
to species and belonged to a species with known DT, the ‘trees per hectare’ value 
(see above) of the genus-level individual was divided into different known-DT 
species in proportion to their relative abundances (based on AGB) ina given grid 
cell and age class (in the 1980s and 2000s combined). The above algorithm allowed 
us to assign DT values for most individuals with missing DT. We restricted our 
subsequent analyses to FIA plots for which individuals with unassigned DT (after 
splitting genus-level identifications to known-DT species as explained above) com- 
prised less than 20% of plot AGB. The excluded plots accounted for only 0.24% 
and 0.16% of total plots in the 1980s and 2000s, respectively. 

Similarly, we calculated community-mean shade tolerance (ST) and its change 
(AST) based on a species-specific shade-tolerance index (ST) from ref. 20, rang- 
ing from 1 (very intolerant) to 5 (very tolerant). As noted above, DT was unavai- 
lable for some species, and ST was unavailable for these same species. For 
individuals for which we replaced missing DT with assigned values, we also 
replaced missing ST using the same procedure as described above. We then 


i=1 AGB(ST;, 


calculated ST for each grid cell, age class and decade as ST = x ; and we 


DIL AGB; 
calculated its change as AST = SThoo0s — STiogos. 
To evaluate whether our main results depended on using AGB weights to cal- 
culate DT and ST, we repeated our analyses using basal-area weights (proportional 
to tree diameter squared), which yielded very similar results. 
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Quantifying decadal changes in forest biomass. We estimated AGB (Mg ha ') 
for individual trees as described in the previous section. We then summed these 
individual AGB values to estimate plot-level AGB (Mg ha ') for each FIA plot, and 
we calculated average AGB across plots within each grid cell, decade and age class. 
The change in AGB density from the 1980s to 2000s was then calculated for each 
age class within each grid cell as AAGB = AGB 2090s — AGByog05. 

Quantifying the response of community-mean drought tolerance and biomass 
to climate variability using spatial regression models. We implemented SAR 
models for each age class (see subsection ‘Stand-age classes’) with the spautolm 
function in the ‘spdep’ package*’ in R (https://www.r-project.org). We used the 
following SAR model to quantify the response of community-mean drought 
tolerance (ADT) to climate variability (APDSI) while controlling for changes in 
growing season length (AGSL) and community-mean shade-tolerance (AST): 


ADT = dy) 1+ dp APDSI + dg AGSL+ ds AST + u (1) 


in which bold terms are vectors of length n (the number of grid cells included in 
the analysis; 1 is a vector of 1 s of length 1); do is the intercept; dp, dg, and dg are the 
slopes associated with the corresponding predictors; and u is a vector of spatially 
autocorrelated errors. This vector is defined asu= AWu + e, in which W is the 
row-standardized n x n spatial weights matrix (based on inverse distances to the 
8 neighbour grid cells); € are spatially independent errors; and 4 is the spatial 
autoregressive coefficient. 

We used an analogous SAR model to quantify the response of above-ground 
biomass (AAGB) to climate variability (APDSI) and shifts in community-mean 
drought tolerance (ADT): 


AAGB = ay 1+ ap APDSI+ ap ADT + ag AGSL+as AST+u = (2) 


in which ap is the intercept; ap, ap, ag and ag are the slopes associated with the 
corresponding predictors; and u is a vector of spatially autocorrelated errors as 
described above. 

We assumed linear forms for regression models because Pearson linear and 
Spearman rank correlations yielded very similar results for bivariate relationships 
among ADT, AAGB and APDSI (Fig. 2), and because partial regression plots 
from multiple regression models did not reveal strong or consistent nonlinearities. 
Partitioning variation in ADT and AAGB using structural equation modelling 
and independent effects analysis. To evaluate the robustness of our SAR model 
results (that is, the effects of APDSI on ADT, and the effects of APDSI and ADT 
on AAGB), we used two additional methods—SEM* and IEA*4—to quantify 
relationships between response and predictor variables. 

SEM is based on networks of hypothesized causal relationships among variables 
and is designed to disentangle potential causal pathways in multivariate datasets*’. 
The SEM model structures we evaluated are illustrated in Extended Data Fig. 7a-c. 
We implemented SEM with the sem function in the ‘lavaan’ package” in R. 

IEA compares models fit with all possible contributions of predictor variables 

to quantify the contribution of each variable to goodness-of-fit™. For example, if 
the improvement of goodness-of-fit due to including a given variable x is always 
greater than the improvement due to including any other variable, then variable 
x would be identified by IEA as having the greatest contribution to explaining 
variance in the response variable. IEA was applied to non-spatial forms of the 
regression models described in the previous subsection using the hier.part function 
in the ‘hier.part’ package in R. 
Quantifying direct and indirect effects of APDSI on AAGB and the amplifying 
effect of ADT. We used both SAR and SEM approach to quantify ‘direct’ effects 
of APDSI on AAGB (for example, due to tree-level physiological processes) versus 
‘indirect’ effects that occur because of changes in species composition. Direct and 
indirect effects are illustrated in Extended Data Fig. 8a, b. To quantify these effects 
in SAR analysis, we combined equations (1) and (2): 


AAGB = ap 1+ ap APDSI + ap ADT 
= ay 1+ ap APDSI + ap(dp 1+ dp APDSI) 


Therefore: AAGB = (ap + ap - do) 1+ (ap + ap: dp) APDSI (3) 


Thus, the total effect of APDSI on AAGB is ap + ap - dp, the direct effect is ap 
(which is simply the partial regression coefficient for the effect of APDSI on 
AAGB), and the indirect effect is ap - dp; that is, the effect of APDSI on AAGB 
due to the combined effects of ADT on AAGB (ap) and APDSI on ADT (dp). 
Terms for growing season length (AGSL) and community-mean shade tolerance 


(AST) are omitted from equation (3) because we control for these sources of 
variation here by setting AGSL= AST =0 to isolate the effects of APDSI and 
ADT. We also omit the error terms, as we rely on the previously fit SAR models 
(equations (1) and (2)) for our analysis of equation (3). 

The estimated effect of ADT on the slope of AAGB versus APDSI is amplify- 
ing (rather than moderating) because estimates for ap and dp are both negative 
(Extended Data Fig. 8c). Thus, their product is positive, which increases the steep- 
ness of the AAGB versus APDSI slope (equation (3)). To formally test the hypo- 
thesis that ap - dp is greater than zero, we used two different approaches, which 
both confirmed that ap - dp is significantly greater than zero. The first approach 
tested the hypothesis (that is, ap - dp > 0) analytically by calculating the estimates 


and standard errors of ap - dp, which are dp - dp and ap - dp, (= : ae 3) : 
D 

respectively. Estimates of ap and their standard errors (ap and o,,,) are available 
from the SAR model for AAGB (equation (2)); and estimates of dp and 
their standard errors (dp and og,) are available from the SAR model for ADT 
(equation (1)). P values from this first approach are reported in Fig. 3. The second 
approach yielded very similar P values and involved error propagation using Monte 
Carlo simulations, in which we sampled from the approximately multivariate- 
normal parameter distributions from the two separate models (equations (1) and 
(2)) to generate a distribution for the product ap - dp. 

Comparison of PDSI effects on AGB with other global change drivers. We com- 
pared AGB responses in our study (based on the range of slopes in Fig. 2b) to the 
effects of nitrogen deposition” and the mean USA forest carbon sink”. All effects 
were expressed in AGB units of Mg C ha“! yr~!, assuming a 2:1 AGB:C ratio. 

Over recent decades, nitrogen deposition has increased carbon uptake in USA 
forests by an estimated 24-65 kg C kg N“!, which includes 5kg C kg Nin soil 
carbon and a multiplier of 1.2 to account for root biomass””. Removing the soil 
and root components yields the range (24—5)/1.2 to (65—5)/1.2, or 15.8-50kg 
Ckg N!. Estimates of USA forest area (3.1 x 10° ha) and nitrogen deposition 
(2.3 Tg N yr~!) from ref. 22 yield a deposition rate of 7.42kg N ha’! yr_l. 
Multiplying this rate by the effect (15.8-50 kg C kg N~!) yields an estimated AGB 
increase of 0.12-0.37 Mg Cha! yr7!. 

The USA forest carbon sink estimates are 0.72 Mg C ha”! yr“! (for the period 

1990-1999; 66% in biomass) and 0.94Mg C ha! yr! (for the period 2000-2007 
period; 62% in biomass), based on table 2 of ref. 23. Assuming that 80% of biomass 
is AGB** yields 0.72 x 66% x 80% (1990-1999) and 0.94 x 62% x 80% (2000-2007) 
Mg Cha‘! yr~1. Applying +20% uncertainty (which is the estimated 95% confi- 
dence range for USA forests in ref. 23) to these estimates yields 95% confidence 
intervals of 0.30-0.46 (1990-1999) and 0.37-0.56 (2000-2007) Mg C ha lyr! 
for the AGB component of the USA forest carbon sink. 
Data availability. All data used in our analyses are publicly available. FIA forest 
inventory data are available at https://www.fia.fs.fed.us/. Temperature and pre- 
cipitation data are available at http://hydrology.princeton.edu/data.pgf.php. PDSI 
data are available at http://hydrology.princeton.edu/data.pdsi.php. The originally 
published versions of the DT and ST indices are available at http://www.esapubs. 
org/Archive/mono/M076/020/appendix-A.htm, and updates are described in 
Supplementary Methods 1. The US Census Bureau is the source of the reference 
map (US state boundaries). 
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Extended Data Figure 1 | PDSI and growing-season length in the 1980s 
and 2000s, and relationships between PDSI and precipitation in the 
eastern USA. Maps show 1° latitude x 1° longitude grid cells. a~c, Mean 
growing-season PDSI (see Methods) for the 1980s (a), the 2000s (b) and 
the change in PDSI (APDSI) in PDSI units (Upps) (c) between the two 
decades. d, Smoothed distribution of APDSI; the vertical line is the mean, 
with the mean absolute change (MAC; mean of absolute values) indicated. 
e-g, Mean GSL for the 1980s (e), the 2000s (f) and the change in GSL 
(AGSL, months) (g) between the two decades. h, Smoothed distribution 
of AGSL; the vertical line is the mean, with the MAC indicated. i, The 


response of PDSI to precipitation change. The map shows regression slopes 
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of APDSI; (change in growing-season PDSI between successive years from 
1948 to 2009) versus AP, (change in growing-season precipitation between 
successive years); slope units are PDSI units (Uppsr) per cm precipitation. 
The sample size for each regression (one regression per grid cell) is 61 (the 
number of annual changes from 1948-2009). j, Pearson's correlation (r) 
between APDSI, and AP,; samples sizes as in i. k, The percentage change 
in growing-season precipitation corresponding to a one-unit change in 
PDSI; percentages were calculated as 100 times the inverse of the slope 
from i divided by the mean (1948-2009) growing-season precipitation for 
each grid cell. 
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Extended Data Figure 2 | Community-mean drought tolerance in the 
1980s and 2000s in the eastern USA. DT was calculated within 1° grid 
cells and 20-year stand-age classes that contained at least five FIA 
inventory plots that satisfied filtering criteria (see Methods). From left, the 
columns show values of DT for the 1980s, the 2000s, the change between 
the two decades and the smoothed distributions of changes (vertical lines 


Age class 


Age class 


Age class 


40-60 yr 


Age class 
80-100 yr 


c ADT en Use d_ Age class: 0-20 yr 
-0.4-0.2 0 0.2 04 


, MAC = 0.23 


Q 
4 
wo) 
) 
L=J 
Age class f-) 
0-20 yr 15 05 05 


ADT (Upr) 


g ADT Ust h_ Age class: 20-40 yr 
-0.4-0.2 0 0.2 04 - 
A MAC = 0.21 
re 
od 
wo 
-) 
fo} 
Age class fo) 
nena 0.5 0.0 05 1.0 
ADT (Upr) 
ADT - 40- 
k Uor | Age class: 40-60 yr 


MAC = 0.14 


Age class 
40-60 yr 0.5 00 05 


Age class: 60-80 yr 


MAC = 0.15 


oO 
Age class f) 
60-80 yr 0.4 00 04 08 


ADT (Upr) 


s ADT t Age class: 80-100 yr 


Upr 


MAC = 0.18 


2.0 


1.0 


0.0 


Age class 
80-100 yr 0.5 


__ 05 1.0 
ADT (Upr) 


show mean changes, with MAC indicated). Different stand-age classes are 
shown in each row as follows: a—d, 0-20 years; e-h, 20-40 years; i-l, 40-60 
years; m-p, 60-80 years; q-t, 80-100 years. DT units (Upr) are on the 
scale of the species drought tolerance (DT) index, which increases from 1 
(very intolerant) to 5 (very tolerant); see Methods and Supplementary 
Methods 1 for details. 
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Extended Data Figure 3 | Above-ground live biomass in the 1980s 

and 2000s in the eastern USA. AGB (Mg ha’') was estimated from 
allometries and averaged within 1° grid cells and 20-year stand-age classes 
that contained at least five FIA inventory plots that satisfied filtering 
criteria (see Methods). From left, the columns show values of AGB for the 
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1980s, the 2000s, the change between the two decades and the smoothed 
distributions of changes (vertical lines show mean changes, with MAC 
indicated). Different stand-age classes are shown in each row as follows: 
a-d, 0-20 years; e-h, 20-40 years; i-1, 40-60 years; m-p, 60-80 years; 
q-t, 80-100 years. 
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Extended Data Figure 4 | Sample sizes and slopes for grid-cell and 
stand-scale analyses. a, Summary of plot ages and sample sizes by age 
class for FIA data used in SAR models and other analyses (Figs 2, 3 and 
Extended Data Figs 7, 8). The bottom row of the table refers to the 
number of remeasured inventory plots used in stand-scale analyses (d, e), in 
which individual plots were tracked over time (Supplementary Methods 4). 
All other rows in the table refer to grid-cell-scale analyses that control for 
stand age by comparing plots in a given age class in the 1980s to plots in 
the same grid cell and age class in the 2000s (b, c). The number of 

grid cells varies across age classes because a grid cell was only included 

in the analysis for a given age class if the grid cell included at least five 
plot records that met our filtering criteria in both the 1980s and 2000s 
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(see Methods). b, SAR model slopes quantifying the mean grid-cell-scale 
response of ADT (in DT units, Ups, in which DT increases with drought 
tolerance and ranges from 1 to 5) to APDSI (in PDSI units, Uppg) from 
the 1980s to 2000s. c, SAR model slopes quantifying the mean grid-cell- 
scale response of AAGB (Mg ha™') to APDSI and ADT from the 1980s to 
2000s. d, e, Similar to b, c, but for stand-level SAR models fit to 
remeasured plots (see Supplementary Methods 4 for details). All SAR 
slopes are partial regression coefficients that control for changes in 
growing-season length, changes in community-mean shade tolerance, and 
spatial autocorrelation (see Methods). Error bars are s.e.m. of slopes, with 
P values shown outside of the bars: *P < 0.05; **P<0.01. 
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Extended Data Figure 5 | Community-mean shade tolerance in the 
1980s and 2000s in the eastern USA. ST was calculated within 1° grid 
cells and 20-year stand-age classes that contained at least five FIA 
inventory plots that satisfied filtering criteria (see Methods). From left, the 
columns show values of ST for the 1980s, the 2000s, the change between 
the two decades and the smoothed distributions of changes (vertical lines 
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show mean changes, with MAC indicated). Different stand-age classes are 
shown in each row as follows: a-d, 0-20 years; e-h, 20-40 years; i-l, 40-60 
years; m—p, 60-80 years; q-t, 80-100 years. ST units (Usr) are on the scale 
of the species shade tolerance (ST) index, which increases from 1 (very 
intolerant) to 5 (very tolerant); see Methods. 
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Extended Data Figure 6 | Harvest intensity and mean drought tolerance of row as follows: a-d, i-l, 0-40 years; e-h, mp, 40-100 years. q, Slopes of ADT 
harvested trees in the 1980s and 2000s in the eastern USA. Harvest intensity versus APDSI from four versions of spatial regression models with different 
Ty (per cent AGB harvested per year, units % yr") and DTy (units Upr) were tree- and plot-filtering criteria and different approaches to modelling harvest 
calculated within 1° grid cells and 20-year stand-age classes for analysis effects (see Supplementary Methods 3 for details). Model 1 corresponds to 
(Supplementary Methods 3), but are shown here in aggregated age classes Extended Data Fig. 4b (note that only the three significant age classes from 
(0-40 and 40-100 years) because patterns were similar among 20-year age Extended Data Fig. 4b are presented here: 40-60, 60-80 and 80-100 years). The 
classes within each aggregated class. a-p, From left, the columns show values of similar slopes and significance levels (P values are shown above each bar) from 
Ty (a-h) or DTy (i-p) for the 1980s, the 2000s, the change between the two the four models suggest that the estimated response of ADT to APDSI is robust 
decades, and the smoothed distributions of changes (vertical lines show mean to including or excluding effects of tree harvest. Error bars are s.e.m. of slopes. 


changes, with MAC indicated). Different stand-age classes are shown in each 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


a Indirect 
d. ADT model 
0-20 yr 40-60 yr 80-100 yr 
12 
(eo 001) a sal 
= <0. ee 
i a : ie (<0. 001). 00 a0 — (0.004) 
73* (< 0.001) 
(< 0.001) 4 
APDSI 
MAGSL 
MAST 
40 


IEA 


e. AAGB model 


0-20 yr 20-40 yr 40-60 yr 60-80 yr 80-100 yr 
16 25* Z 
+ (0.020) 33** 
= 48* 13 (0.003) 
WW + (<0.001) 
‘ APDSI 
(0.012) (< 0.001) NM ADT 
™ AGSL 
MAST 
32 


IEA 


23 


Extended Data Figure 7 | Structural equation model and independent 
effects analysis results for the response variables ADT and AAGB. 

a, SEM structure for the ADT model. b, SEM structure for the AAGB 
model. c, SEM structure for the alternative AAGB model that includes 
both direct and indirect effects of APDSI on AAGB (see Methods and 
Extended Data Fig. 8a, b for further explanation of direct and indirect 
effects). d, Per cent contributions of APDSI, AGSL and AST to the 
explained variation in ADT. e, Per cent contributions of APDSI, AGSL, 
AST and ADT to the explained variation in AAGB (SEM results are for 


71 


the model structure shown in b). Direct and indirect effects from c are 
reported in Extended Data Fig. 8d. Both SEM and IEA provide variance- 
partitioning estimates. In addition, SEM provides significance tests for 
explanatory variables. Significant P values (P < 0.05) are shown in the 
parentheses below the corresponding percentage contribution, *P < 0.05; 
**P < 0.01. See Methods for details of SEM and IEA analyses. Positive and 
negative signs in d, e indicate the signs of the SEM and IEA coefficients, 
which are consistent with the signs of SAR model coefficients (Extended 
Data Figs 4 and 8). 
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60-80 -0.038 0.011 -39.40 < 0.001 5.52 0.006 
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d Ageclass SEM SAR 
(yr) Direct coef. (Pvalue) Indirect coef. (P value) Indirect %| Direct coef. (Pvalue) Indirect coef. (P value) Indirect % 


0-20 0.142 (0.052) 0.010 (0.385) 
20-40 0.264 (< 0.001) 0.013 (0.268) 
40-60 0.143 (0.020) 0.058 (0.005) 
60-80 0.179 (0.002) 0.095 (< 0.001) 
80-100] 0.214 (0.003) 0.048 (0.031) 


6.6 0.130 (0.118) 0.014 (0.136) 9.8 

4.7 0.156 (0.053) 0.011 (0.171) 6.7 
28.7 0.108 (0.187) 0.036 (0.032) 25.0 
34.6 0.210 (0.006) 0.057 (0.012) 21.4 
18.3 0.171 (0.068) 0.041 (0.019) 19.5 


Extended Data Figure 8 | Conceptual model and supporting evidence 
for direct and indirect effects of changes in water availability on forest 
biomass. a, Conceptual model of direct (solid arrow) and indirect (dashed 
arrows) effects of decreasing water availability (APDSI <0) on forest 
biomass. b, Conceptual model of direct (solid arrow) and indirect (dashed 
arrows) effects of increasing water availability (APDSI > 0) on forest 
biomass. The conceptual model in a, b is supported by our results (c, d), 
which show that the response of forest biomass to APDSI is amplified by 
indirect effects; for example, if water availability decreases (APDSI < 0), 
then biomass decreases owing to direct effects (for example, physiological 
tree-level responses) as well as indirect effects (shifts in species 
composition towards more drought-tolerant but lower-biomass species). 
c, Slopes and P values from SAR models (equations (1) and (2)) for 


different stand-age classes (1° grid-cell-scale results, as in Extended Data 
Fig. 4b, c). Slopes are partial regression coefficients, so the slope labelled 
‘response of AGB to APDS? is the direct effect of APDSI on AAGB 
(controlling for ADT and other covariates), and the indirect effect is 
estimated by the product of the other two slopes (see Methods). d, Direct 
and indirect effects of APDSI on AGB estimated from SEM (Extended 
Data Fig. 7c) and SAR models (see Methods). Sample sizes in these 
analyses (number of grid cells) are: 171, 247, 271, 271, and 193 for age 
classes from 0-20 to 80-100, respectively (as shown in Extended Data 
Fig. 4a). For consistency with SEM, SAR slopes are standardized in d 
(standard deviation units) but are otherwise equivalent to SAR coefficients 
in cand in Fig. 3. The percentages in d are calculated as 100 x indirect / 
(direct + indirect). 
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Extended Data Figure 9 | Distributions of species influence on 
estimated responses of community-mean drought tolerance (DT) to 
APDSI, AGB to APDSI, and AGB to ADT. Species influence (x axis) is 
the per cent change in SAR model slopes owing to including an individual 
species in the analysis (see Supplementary Methods 6, 7 for details). 
Probability density (y axis) is the relative frequency of species with a given 
per cent influence. Black curves are for all species, and red and blue curves 
are for gymnosperm and angiosperm species, respectively. Vertical lines 
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are means. The distributions show that most species have little influence 
on the SAR slopes, some species have large influence, and gymnosperms 
and angiosperms have similar degrees of influence. From left, the columns 
show species influences on parameter dp from equation (1), and species 
influences on parameters ap and ap from equation (2). Different stand-age 
classes are shown in each row as follows: a—c, 0-20 years; d-f, 20-40 years; 
g-i, 40-60 years; j-I, 60-80 years; m-o, 80-100 years. 
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Extended Data Figure 10 | Species abundance distributions in the 
northeastern, north-central and southeastern USA in the 1980s and 
2000s. a, Map of the three mentioned sub-regions of the USA. b-p, Species 
abundance distributions (number of species in different abundance 
intervals) for each age class (rows) in each sub-region (columns). Blue 

and red bars represent numbers of species in the 1980s and 2000s, 
respectively. Purple indicates overlap of the two bars. Abundance is 
quantified as the percentage of total above-ground biomass comprised by 
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A new class of synthetic retinoid antibiotics 
effective against bacterial persisters 


Wooseong Kim!, Wenpeng Zhu’, Gabriel Lambert Hendricks!, Daria Van Tyne**, Andrew D. Steele*°, Colleen E. Keohane”, 
Nico Fricke’, Annie L. Conery”®, Steven Shen!, Wen Pan!, Kiho Leel, Rajmohan Rajamuthiah', Beth Burgwyn Fuchs’, 
Petia M. Vlahovska’, William M. Wuest>°, Michael S. Gilmore**, Huajian Gao?, Frederick M. Ausubel’* & Eleftherios Mylonakis! 


A challenge in the treatment of Staphylococcus aureus infections 
is the high prevalence of methicillin-resistant S. aureus (MRSA) 
strains and the formation of non-growing, dormant ‘persister’ 
subpopulations that exhibit high levels of tolerance to antibiotics! 
and have a role in chronic or recurrent infections*®. As conventional 
antibiotics are not effective in the treatment of infections caused 
by such bacteria, novel antibacterial therapeutics are urgently 
required. Here we used a Caenorhabditis elegans-MRSA infection 
screen® to identify two synthetic retinoids, CD437 and CD1530, 
which kill both growing and persister MRSA cells by disrupting 
lipid bilayers. CD437 and CD1530 exhibit high killing rates, 
synergism with gentamicin, and a low probability of resistance 
selection. All-atom molecular dynamics simulations demonstrated 
that the ability of retinoids to penetrate and embed in lipid bilayers 
correlates with their bactericidal ability. An analogue of CD437 
was found to retain anti-persister activity and show an improved 
cytotoxicity profile. Both CD437 and this analogue, alone or 
in combination with gentamicin, exhibit considerable efficacy 
in a mouse model of chronic MRSA infection. With further 
development and optimization, synthetic retinoids have the 
potential to become a new class of antimicrobials for the treatment 
of Gram-positive bacterial infections that are currently difficult 
to cure. 

We used an established automated high-throughput C. elegans- 
MRSA killing assay in 384-well plates® to screen approximately 82,000 
small synthetic molecules, and identified 185 compounds that signifi- 
cantly decreased the ability of MRSA to kill the nematodes (Fig. 1a, 
Supplementary Table 1, Supplementary Methods). Two of these 185 
compounds, the synthetic retinoids CD437 and CD1530 (vitamin A 
analogues; Fig. 1b), were selected for further investigation because 
they have similar structures and have been studied previously for their 
therapeutic potential’~'°. 

CD437 and CD1530 exhibit potent in vitro bactericidal activity 
against MRSA strain MW2,; after two hours, levels of MW2 were below 
the limit of detection (minimum inhibitory concentration (MIC) 
lpg ml7|; Fig. 1c, Extended Data Fig. 1a, Supplementary Table 2). 
In vivo, CD437 or CD1530 at concentrations above their in vitro 
MICs protected 100% of C. elegans against MW2-induced death 
(Fig. 1d). CD437 and CD1530 also exhibited potent activity against 
a panel of clinical S. aureus and Enterococcus faecium strains, but 
not against Gram-negative species (Supplementary Table 2). In 
addition, adarotene, a structural analogue of CD437 and CD1530 
and a potential ovarian cancer drug® (Fig. 1b), also exhibited signifi- 
cant anti-staphylococcal activity (MIC 21g ml~') and prevented the 
MRSA~- induced death of C. elegans. However, adapalene, another ana- 
logue and a US Food and Drug Administration (FDA)-approved acne 


therapeutic™, was ineffective against MRSA (Fig. la—d, Extended Data 
Fig. 1a, Supplementary Table 2). 

We were unable to obtain retinoid-resistant mutants by plating 10'° 
colony-forming units (CFU) of S. aureus M“W2 on agar containing 
2.5x,5x or 10x MIC of CD437, CD1530 or adarotene. Similarly, 
serial passage of two independent S. aureus MW2 cultures (SP1 and 
SP2) for 100 days in sub-MIC levels of CD437 yielded only putative 
mutants with twofold greater resistance to CD437, CD1530 or adar- 
otene, whereas serial passage in ciprofloxacin for 100 days (Fig. le) 
or daptomycin for 15 days (Extended Data Fig. 1b) generated strains 
that were 256-fold and tenfold more resistant, respectively. The MW2 
cultures that exhibited modest retinoid resistance contained mutations 
in the genes graS, yjbH and manA (Fig. 1f, Supplementary Tables 3-5, 
Supplementary Discussion), which encode products related to mem- 
brane physiology'*-'®. Consistent with this finding, CD437, CD1530 
and adarotene—but not adapalene—induced membrane permeabi- 
lization in MW2 (monitored by SYTOX Green uptake; Fig. 2a), and 
CD437 and CD1530 caused the formation of mesosome-like structures 
(observed by transmission electron microscopy; Fig. 2b), similar to 
those observed in S. aureus cells after treatment with antimicrobial 
peptides'”. Moreover, CD437, CD1530 and adarotene disrupted the 
integrity of biomembrane-mimicking giant unilamellar vesicles (Fig. 2c, 
Supplementary Videos 1-5). These vesicles consist of a DOPC:DOPG 
lipid bilayer at a ratio of 7:3 (DOPC/G, 1,2-dioleoyl-sn-glycero-3- 
phosphocholine/glycerol), which mimics anionic bacterial membranes, 
and have been used to elucidate the mechanisms of action of dapto- 
mycin in S. aureus!®!9, Notably, however, CD437 and CD1530 did not 
lyse bacterial cells directly (Extended Data Fig. Ic). 

To elucidate the molecular interactions between retinoids and the 
membrane lipid bilayers of S. aureus, we conducted all-atom molecular 
dynamics simulations using a lipid bilayer composed of 108 phosphati- 
dylglycerol lipids, 72 lysyl-phosphatidylglycerol (Lys-PG) lipids and 10 
diphosphatidylglycerol (DPG, also known as cardiolipin) lipids, which 
mimics the phospholipid composition of S. aureus membranes”. These 
simulations showed that the carboxylic acid and the phenolic groups 
of CD437, CD1530 and adarotene anchor these retinoids to the sur- 
face of the membrane bilayer by binding persistently to hydrophilic 
lipid heads. As a result, the retinoids penetrate the bilayers and become 
embedded orthogonally to the lipid molecules in the outer membrane 
leaflet, inducing substantial perturbations of the membrane (Fig. 2d, e, 
Supplementary Videos 6-9). Similar results were obtained for mole- 
cular dynamics simulations of DOPC:DOPG (7:3) lipid bilayers used 
in the giant unilamellar vesicle experiments in Fig. 2c (Extended Data 
Fig. 2a, b, Supplementary Videos 10-13). In contrast to CD437, 
CD1530 and adarotene, adapalene does not penetrate the membrane 
owing to a high energy barrier (11.22 kgT) and an unfavourable 
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Figure 1 | Synthetic retinoids protect C. elegans from MRSA infection 
and inhibit MRSA growth without detectable mutant development. 

a, Images of MRSA-MW2-infected C. elegans in the presence of 10 pg ml“! 
retinoids, 101g ml"! vancomycin, or 1% DMSO as a control (see 
Supplementary Methods). Only dead worms stain with SYTOX Orange. 
Experiments were independently repeated three times with similar results. 
b, Chemical structures of synthetic retinoids. c, Growth of MW2 exposed 
to the five indicated compounds at various concentrations after 18 hours 
in tryptic soy broth. OD¢00; optical density at 600 nm. d, Survival of C. 
elegans infected with MW2 in the presence of retinoids, normalized to C. 


transfer energy (3.16 kgT) (Fig. 2e, Supplementary Table 6), as the 
hydrophobic methoxy group does not bind to lipid heads (Fig. 2d, 
Supplementary Video 9). CD437-like retinoids can be metabolized 
in the liver by glucuronidation at carboxylic or hydroxyl groups”'. 
Molecular dynamics simulations showed that two CD437 glucuron- 
ide metabolites also penetrate into lipid bilayers, exhibiting a similar 
penetration mechanism to that of CD437 (Extended Data Fig. 2a, c, 
Supplementary Videos 14, 15, Supplementary Discussion). In sum- 
mary, these molecular dynamics simulations showed that two polar 
branch groups—a phenol and a carboxylate—have essential roles 
in membrane attachment and penetration, and that the membrane 
activity of retinoids (Fig. 2) directly correlates with their antibiotic 
activity (Fig. 1c, Extended Data Fig. 1a). 

CD437 or CD1530—but not adarotene—also induced rapid permea- 
bilization of MRSA-persister membranes (Extended Data Fig. 3a), and 
killed MRSA-persister cells (Fig. 3a, Extended Data Fig. 3b). They also 
completely eradicated persisters formed by 13 clinical isolates, includ- 
ing the multi-drug resistant strain VRS1 within 1 to 4hours at 8-10x 
MIC (Fig. 3b, Extended Data Fig. 3c, d). Moreover, CD437 or CD1530 
killed around 90% and 100% of persisters formed in MRSA biofilms 
at 16x MIC and 32x MIC, respectively (Extended Data Fig. 4). 
Compared with adarotene, CD437 and CD1530 can penetrate the 
membrane more efficiently owing to lower energy barriers and more 
favourable transfer energies (Fig. 2e); this is consistent with the obser- 
vation that adarotene is inactive against persister cells. The results sug- 
gest that the planar aryl moiety of CD437 and CD1530 (highlighted in 
blue, Fig. 1b) rigidifies the carboxylic acid, which facilitates penetration 
into lipid bilayers, whereas the flexible cinnamoyl moiety of adarotene 
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elegans treated with DMSO. c, d, Individual data points (n = 3 biologically 
independent experiments) and mean + s.d. are shown. e, Appearance 

of spontaneous CD437- and ciprofloxacin-resistant MW2 mutants over 
100 days of serial passage in duplicate (SP1 and SP2) (see Supplementary 
Methods). f, Appearance of mutations on specific days in the indicated 
genes in SP1 and SP2 in e (see Supplementary Methods). The modest 
increase in the MIC of CD437 against MRSA during serial passage was 
confirmed by remeasuring MICs using three colonies from aliquots of 
each passage that had been stored at —80°C. Mutated genes are indicated 
on the day at which the mutations were first detected. 


fails to orient the carboxylate appropriately, thereby decreasing mem- 
brane penetration (Fig. 2e). 

CD437 or CD1530 exhibited significant synergism with gentamicin 
against both MRSA growing and persister cells (Fig. 3c, Supplementary 
Table 7, Extended Data Fig. 4). This is most probably a consequence 
of the increased passive diffusion of gentamicin through the bacterial 
cell membranes that have been physically damaged by the retinoids, 
which is mechanistically distinct from the observed synergism between 
gentamicin and ionophores” (Extended Data Fig. 5, Supplementary 
Discussion). 

Although membrane-targeting agents often cause toxicity in 
mammals”3, CD437, CD1530 and adarotene are relatively non-toxic, 
exhibiting median haemolytic concentrations (HC59) of greater than 
32 yg ml~! (Extended Data Fig. 6a). CD437, CD1530 and adarotene 
were more toxic to human hepatoma HepGz? cells (median lethal 
concentration (LCs9) 3-5 ug ml~') than to normal human primary 
hepatocytes (LCs» > 201g ml~!), or to primary renal proximal tubule 
epithelial cells or adult normal human epidermal keratinocytes at 
8g ml~! (Extended Data Fig. 6b), a concentration at which CD437 
and CD1530 completely eradicated MRSA persisters (Fig. 3a). These 
data are consistent with previous results showing that CD437 exhibits 
selective toxicity towards cancer cells!°. None of the three retinoids 
inhibited the human ether-a-go-go related (hERG) potassium chan- 
nels that are critical for cardiac action potential repolarization at 251M 
(Extended Data Fig. 6c) and did not show significant genotoxic poten- 
tial (Supplementary Table 8). 

To evaluate the effects of the CD437-like retinoid branch groups 
on antimicrobial activity and the possibility of further structural 
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Figure 2 | CD437, CD1530 and adarotene disrupt membrane lipid 
bilayers. a, Uptake of SYTOX Green (Ae. = 485 nm, Aem = 525 nm) by 
exponential-phase S. aureus MW2 cells treated with retinoids. Individual 
data points (n= 3 biologically independent samples) and means are 
shown. Error bars not shown for clarity. b, Transmission electron 
micrographs showing mesosome-like structures (white and red arrows; 
enlarged in bottom images) in 10x MIC retinoid-treated cells and DMSO 
control. Scale bars, 200 nm. c, Changes in giant unilamellar vesicles 
(DOPC:DOPG. 7:3) labelled with 18:1 Liss Rhod PE (0.05%) treated with 
retinoids or with 0.1% DMSO, monitored using fluorescence microscopy 
(40x objective, \-x=460nm, Aem = 483 nm). Liss Rhod PE, 1,2-dioleoyl- 
sn-glycero-3-phosphoethanolamine-N-(lissamine rhodamine B sulfonyl) 
(ammonium salt). In b, c, experiments were independently repeated 


optimization with respect to antimicrobial and toxicity profiles, we syn- 
thesized 16 analogues of CD437 (Extended Data Fig. 7a). Subsequent 
analysis of their structure-activity relationships supported the putative 
mode of action by which the synthetic retinoids disrupt Gram-positive 
bacterial membranes, and demonstrated that the antimicrobial activity 
and cytotoxicity of synthetic retinoids can be modulated by the polar- 
ity of the branch groups (Extended Data Figs 7-9, Supplementary 
Videos 16, 17, Supplementary Discussion). In particular, analogue 2, 
which has a less polar primary alcohol instead of the carboxylic acid 
group, retained bacterial activity against MRSA persisters (Fig. 4a, b), 
but showed significantly less haemolytic activity (HCs9 > 321g ml, 
Extended Data Fig. 8a) and less cytotoxicity in a panel of human cell 
lines (LCs > 31g ml~') than did CD437 (Fig. 4c, Extended Data 
Fig. 6b). Analogue 2 also showed significantly reduced activity towards 
human hepatoma HepG2? cells, with LCso values of >32 1g ml7! 
(Fig. 4c). In addition, molecular dynamics simulations revealed that 
analogue 2 penetrates membrane lipid bilayers with similar energy 
profiles to those of CD437 (Extended Data Fig. 8d, e, Supplementary 
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twice with similar results. d, Representative configurations of molecular 
dynamics simulations of retinoids at, from left to right, onset, membrane 
attachment, membrane penetration and equilibrium interacting with 
lipid bilayers (membrane composition: 108 phosphatidylglycerol lipids, 
72 Lys-PG lipids, and 10 DPG lipids; see Supplementary Methods for 
atomic rendering). Simulations were repeated five times with similar 
results. e, Free-energy profiles of retinoids penetrating the membrane as a 
function of the distance between the centre-of-mass (COM) of the retinoid 
and the lipid bilayer. The dot-dashed line marks the membrane surface, 
averaged from the COM location of phosphate groups in outer leaflet. 
Individual data points (n = 3 independent simulations) and mean + s.d. 
are shown. 


Video 16), further establishing that the extent of membrane penetra- 
tion inferred from molecular dynamics simulations correlates with 
antimicrobial activity. In summary, the structure-activity relationships 
verified that persistent attachment to lipid heads by the two polar branch 
groupsis critical for antimicrobial activity, and that antimicrobial activity 
and lack of cytotoxicity can be optimized by simple modifications to 
the polar branch groups. Notably, analogue 2 also exhibited favoura- 
ble pharmacokinetic profiles after intraperitoneal administration of a 
single dose of 20mg kg~', with a maximum plasma concentration of 
around 10g ml~! and an elimination half-life of 4.5 hours (Extended 
Data Fig. 8f). By contrast, adarotene is excreted rapidly”!**. Analogue 2 
showed no detectable hepatic or renal toxicity in mice at intraperitoneal 
doses of up to 80mg kg™! (the highest tested dose) every 12 hours for 
3 days (Extended Data Fig. 8g). 

Finally, we evaluated the efficacy of both analogue 2 and the combina- 
tion of analogue 2 and gentamicin in a mouse deep-seated thigh MRSA 
infection model, which mimics human deep-seated chronic infections”. 
Consistent with previous findings’, a combination of vancomycin and 
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Figure 3 | CD437 or CD1530 alone or in combination with gentamicin 
are effective against persisters. a, b, Viability of stationary-phase 

S. aureus MW2 (a) or S. aureus VRS1 (b) when treated with the indicated 
concentrations of each retinoid for 4hours. c, Viability upon treatment of 


gentamicin did not significantly reduce MRSA abundance (Fig. 4d) 
even though MW2 is sensitive to both antibiotics, which suggests that 
the bacterial cells in this infection model are persisters. As shown in 
Fig. 4d, 80mg kg~! of analogue 2 alone led to an approximately fourfold 


S. aureus MW2 persisters with the indicated concentrations of retinoids in 
combination with gentamicin (Gm). In a-c, the data points on the x axis 
are below the level of detection (2 x 10? CFU ml~!). Individual data points 
(n=3 biologically independent samples) and mean + s.d. are shown. 


decrease (P< 0.001) in MRSA abundance, and 40 or 80 mg kg“! of 
analogue 2 in combination with 30 mg kg~! gentamicin resulted in 
approximately 14-fold (P< 0.001) and approximately 23-fold decreases 
(P <0.001) in bacterial burden, respectively. Similarly, CD437 alone or 
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Figure 4 | Analogue 2 retains antimicrobial activity against MRSA 
persisters and has improved cytotoxicity compared with CD437. 

a, Chemical structure of analogue 2. b, Viability of S. aureus MW2 
persisters treated with analogue 2 alone or in combination with gentamicin 
(Gm). Data points on the x axis were below the level of detection 

(2 x 10° CFU ml’). c, Viability of normal human primary hepatocytes 
and human hepatoma (HepG2) cells treated with retinoids in serum-free 
medium for 24 hours, based on the absorbance readings at 450 nm taken 
4hours after adding the tetrazolium dye WST-1. The FDA-approved 
antineoplastic retinoid bexarotene was used as a control. b, c, Individual 
data points (n = 3 biologically independent samples) and mean + s.d. are 
shown. d, Efficacy of analogue 2 alone or in combination with gentamicin 
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in a deep-seated mouse thigh infection model. Each group of MW2- 
infected neutropenic mice (n = 10 biologically independent animals) 
was treated with the indicated doses of analogue 2 intraperitoneally (i-p.) 
alone or in combination with 30 mg kg! subcutaneous (s.c.) gentamicin 
(Gm), a combination of 25 mg kg“! vancomycin (Van, ip.) and 30 mg 
kg! gentamicin (s.c.) or control (5% Kolliphor + 5% ethanol, ip.) every 
12 hours for 3 days beginning 24 hours after infection. At 12 hours after 
the last treatment, mice were euthanized and their thighs were excised 
and homogenized. CFUs from each mouse thigh are plotted as individual 
points. The mean + s.d. is shown. Statistical differences between control 
and antibiotic treatment groups were analysed by one-way ANOVA and 
post hoc Tukey test (**P = 0.0002, ***P < 0.0001). 
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in combination with gentamicin also exhibited efficacy in the MRSA 
mouse deep-seated thigh infection model (Extended Data Fig. 10). 
These results suggest that a combination of analogue 2 and gentamicin 
or CD437 and gentamicin might be an effective strategy to enhance the 
efficacy and reduce the toxicity of aminoglycosides” in the treatment 
of chronic Gram-positive infections. 

Despite the potential advantages of membrane-active antimicro- 
bials such as the retinoids described here—including fast killing, low 
probability of developing resistance, and anti-persister activity— 
the major obstacle for developing retinoids as therapeutics is their 
potential cytotoxicity, which is a matter of considerable debate*>”®. 
Nevertheless, we have identified a specific chemotype of membrane- 
active synthetic retinoids that are relatively selective for bacterial 
membranes and exhibit a high level of activity towards MRSA per- 
sister cells; these findings are notable because the development of 
appropriate antibiotics for persisters is an important unmet need. 
Although a limited analysis of structure-activity relationships showed 
that modification of the retinoid branch groups can result in improved 
cytotoxicity profiles while retaining anti-persister activity, it is impor- 
tant to acknowledge that the long term-potential of further chem- 
ical optimization of retinoids to develop non-toxic antimicrobials 
is currently unknown. However, considering the fact that the bioac- 
tivity of retinoids can be improved by modifying both the backbone 
and branch groups, and that approximately 4,000 retinoid analogues 
have been synthesized so far?°, our results warrant further develop- 
ment of synthetic retinoids as potential therapeutics for hard-to-treat 
infectious diseases caused by antibiotic-resistant or persistent Gram- 
positive pathogens. 


Data Availability All data are available within the paper and its Supplementary 
Information. 
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Extended Data Figure 1 | CD437 and CD1530 show fast-killing kinetics 
and low probability of resistance development, and do not cause 
detectable cell lysis. a, Exponential-phase MRSA cells (strain MW2) were 
treated with 10x MIC CD437, CD1530, adarotene, vancomycin or 0.1% 
DMSO (negative control). CFU counts of cells were measured by serial 
dilution and plating on agar plates. The data points on the x axis are below 
the level of detection (2 x 10? CFU ml’). Individual data points (n =3 
biologically independent samples) and mean + s.d. are shown. 
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b, Development of S. aureus MW2 mutants resistant to CD437 (SPcp437), 
CD1530 (SPcpis30) or daptomycin (SPpap) was attempted by daily serial 
passage for 15 days. c, Exponential-phase S. aureus MW2 bacteria were 
treated with 10x MIC CD437, CD1530 or benzalkonium chloride (BAC) 
for 4h. The anti-infective detergent BAC was used as a positive control 
for cell lysis. ODgo9 was measured in a spectrophotometer every hour. 
Individual data points (n = 3 biologically independent samples) and 
mean +s.d. are shown. 
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Extended Data Figure 2 | All-atom molecular dynamics simulations 
showing the interactions between selected retinoids or retinoid 
metabolites and a DOPC:DOPG (7:3) lipid bilayer. a, Representative 
configurations of synthetic retinoids or retinoid metabolites at, left 

to right, the onset of simulation, membrane attachment, membrane 
penetration and equilibrium state (see Supplementary Methods for atomic 
rendering). Simulations were repeated five times with similar results. 

b, c, Free energy profiles of the four retinoids (b) or CD437-metabolites 
(c) penetrating the membrane as a function of the distance between the 
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COM of the retinoids or the retinoid metabolites and the lipid bilayer. 

The dot-dashed line marks the membrane surface, averaged from the 
COM location of phosphate groups in the outer leaflet. Individual data 
points (mn = 3 independent simulations) and mean + s.d. are shown. The 
membrane penetration of CD437, CD1530, adarotene, adapalene, the 
carboxylic-glucuronide metabolite and the phenolic hydroxyl-glucuronide 
metabolite are associated with transfer energies of —8.92 kgT, —7.14 kgT, 
—1.45 kgT, 18.76 kgT, —3.73 kgT, —2.02 kgT and energy barriers of 1.42 
kgT, 1.12 kT, 2.03 kgT, 26.13 kpT, 5.01 kgT, 7.40 kgT, respectively. 
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Extended Data Figure 3 | CD437 and CD1530 kill MRSA persisters 

by inducing membrane permeabilization. a, S. aureus MW2 persisters 
were treated with the indicated concentrations of the retinoids. Membrane 
permeability was measured spectrophotometrically by monitoring 

the uptake of SYTOX Green (Ax =485 nm, Aem = 525 nm) over time. 
Individual data points (n = 2 biologically independent samples) and means 
are shown; error bars are not shown for clarity. b—-d, Stationary-phase 
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S. aureus MW2 (b) or stationary-phase cells of 11 clinical S. aureus isolates 
were treated with 100 x MIC conventional antibiotics (c) or 10x MIC 
retinoids (d) for 4h. Viability was measured by serial dilution and plating 
on agar plates. The data points on the x axis are below the level of detection 
(2 x 10? CFU ml~!). b-d, Individual data points (n = 3 biologically 
independent samples) and mean + s.d. are shown. 
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Extended Data Figure 4 | CD437 or CD1530 alone or in combination with gentamicin. The number of viable cells in biofilms was measured 
with gentamicin eliminate persisters formed in MRSA biofilms. MRSA by CFU counting. The data points on the x axis are below the level of 
MW2 biofilms formed on 13 mm cellulose ester membranes were treated detection (2 x 10? CFU ml’). Individual data points (n = 3 biologically 
with the indicated concentrations of retinoids alone or in combination 


independent samples) and mean + s.d. are shown. 
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Extended Data Figure 5 | lonophores do not induce SYTOX 

Green membrane permeabilization or kill MRSA MW2 persisters. 

a, Synergism between nigericin and gentamicin was evaluated against 

S. aureus MW2 by the fractional inhibitory concentration index (FICI) 
microdilution checkerboard method. Optical densities at 600 nm were 
measured after 18h incubation at 37 °C. Experiments were independently 
repeated twice with similar results. Synergy, FICI < 0.5; no interaction, 

0.5 < FICI < 4; antagonism, FICI > 4. b, Exponential-phase MW2 cells 
were treated with the indicated concentrations of valinomycin, nigericin or 
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monensin. Membrane permeability was measured spectrophotometrically 
by monitoring the uptake of SYTOX Green (A.x = 485 nm, Aem = 525 nm) 
over time. Individual data points (n = 2 biologically independent 
samples) are shown; error bars are not shown for clarity. c-e, Stationary- 
phase S. aureus MW2 was treated with the indicated concentrations of 
ionophores, alone or combined with 10x MIC gentamicin (Gm), or 
0.1% DMSO (control) for 4h. Viability was measured by serial dilution 
and plating on agar plates. Individual data points (n = 3 biologically 
independent samples) and mean + s.d. are shown. 
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Extended Data Figure 6 | Evaluation of cytotoxic potentials of retinoids 
in various cell lines. a, Measurement of haemolytic activity. 2% human 
erythrocytes were treated with twofold serially diluted concentrations 

of the retinoids for 1h at 37°C. A sample treated with 1% Triton X-100 
was used as the control for 100% haemolysis. b, Normal rat, human 
primary hepatocytes, human hepatoma (HepG2) cells, normal human 
primary renal proximal tubule epithelial cells (RPTEC) or adult normal 
human epidermal keratinocytes (NHEK) were treated with a range of 
concentrations of the synthetic retinoids in chemically defined, serum-free 


medium for 24h. The FDA-approved antineoplastic retinoid bexarotene 
was used as a control. Cell viability was calculated on the basis of 
absorbance readings at 450 nm at 4h after adding WST-1. a, b, Individual 
data points (n= 3 biologically independent samples) and mean + s.d. are 
shown. ¢, Three synthetic retinoids and the positive control quinidine 
were tested for inhibition of the hERG potassium channel. Individual 
data points (n = 4 biologically independent samples) and mean +s.d. are 
shown. Data are fitted to a standard inhibition curve. 
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Extended Data Figure 7 | Structure-activity relationships. a, The monitoring the uptake of SYTOX Green (A-x = 485 nm, Aem = 525 nm) 
chemical structures of newly synthesized CD437 analogues. b, MICs over time. Individual data points (n = 2 biologically independent samples) 
and membrane permeability were measured for S. aureus strain MW2. and means are shown; error bars are not shown for clarity. 


Membrane permeability was evaluated spectrophotometrically by 
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Extended Data Figure 8 | Determination of the biological properties 

of analogues 2 and 9. a, b, Human erythrocytes were treated for 1h (a) 
and rat primary hepatocytes were treated for 24h (b) with analogues 2 and 
9. c, MRSA MW2 persisters were treated with analogue 9. The data points 
on the x axis are below the level of detection (2 x 107 CFU ml’). 

a-c, Individual data points (n = 3 biologically independent samples) and 
mean + s.d. are shown. d, Representative configurations of molecular 
dynamics simulations of analogue 2 interacting with lipid bilayers 

(108 phosphatidylglycerol lipids, 72 Lys-PG lipids and 10 DPG lipids; 

see Supplementary Methods for atomic rendering). Simulations were 
repeated five times with similar results. e, Free energy profiles of analogue 
2, CD437 and adarotene penetrating the membrane as a function of 

the distance between the COM of the retinoids and the lipid bilayer. 

The dot-dashed line marks the membrane surface, averaged from the 
COM location of phosphate groups in outer leaflet. Individual data 

points (n =3 independent simulations) and mean + s.d. are shown. 

f, The plasma concentrations of analogue 2 after a single injection of 
analogue 2 (20mg kg“', i.p., 3 mice per time point) were measured 


16 20 24 


—o— CD437 
—4— Analog 2 
—v— Adarotene 


Potential of mean force (k,T) 


0 1 2 3 4 


ALT BUN 


using LC-MS/MS. Pharmacokinetic analysis was conducted using 
Phoenix WinNonlin software version 6.3. Individual data points (n =3 
biologically independent animals) and mean + s.d. are shown. The 
determined pharmacokinetic parameters are Trax (the time taken to 
reach the maximum concentration) 0.5h, Cmax (maximum concentration 
observed) 16.14 .g ml~!, AUCjast (area under the curve to last time point) 
16.38h-ug ml~!, AUCin¢ (area under the curve to infinite) 16.54h-g ml“!, 
ty (half-life) 4.49 h, clearance 20.16 ml min~! kg~'. g, Six mice per group 
(n= 6 biologically independent animals) were treated with control (5% 
Kolliphor + 5% ethanol, i.p.), vancomycin (25 mg kg~', ip.) or analogue 2 
(10-80 mg kg” !, i.p.) every 12h for 3 days. At 12h after the last treatment, 
alanine aminotransferase (ALT) and blood urea nitrogen (BUN) were 
analysed. The concentrations of ALT (in international units per litre, 

IU 1~!) and BUN (mg dl’) in each mouse serum sample analysed are 
plotted as individual points and the mean + s.d. is shown. Control and 
antibiotic treatments were analysed by one-way ANOVA and post hoc 
Tukey test, which demonstrated a lack of significant differences (P > 0.7 
for all ALT and BUN samples). 
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Extended Data Figure 9 | The charges and the number of branch groups 
affects membrane activity of CD437-like retinoids. a, Comparison of 
partial atomic charges between CD437 and analogue 3. b, Representative 
configurations of molecular dynamics simulations of analogues 3, 11, and 
14 interacting with lipid bilayers (DOPC:DOPG, 7:3). The amide group 
in analogue 3 is repelled away from the membrane despite the attachment 
of the hydroxyl group. Atomic rendering is described in Supplementary 
Methods. Simulations were repeated five times with similar results. 
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c, d, Free energy profiles of analogue 3 penetrating DOPC:DOPG (7:3) 
lipid bilayers (c) and CD437 penetrating differently charged lipid bilayers 
(d). e, Analogues 11 and 14 penetrating DOPC:DOPG (7:3) lipid bilayers 
as a function of the distance between the COM of the retinoids and the 
lipid bilayer. The dot-dashed line marks the membrane surface, averaged 
from the COM location of phosphate groups in the outer leaflet. 

c-e, Individual data points (n = 3 independent simulations) and 

mean +s.d. are shown. 
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Extended Data Figure 10 | In vivo efficacy of CD437 alone or in 
combination with gentamicin in a deep-seated mouse thigh infection 
model. We chose a dose of 20 mg kg~! CD437 to test its in vivo efficacy 

in the MRSA mouse deep-seated thigh infection model, because a dose 
of 20 mg kg”! has shown in vivo efficacy in mouse xenograft cancer 
models*”~’. Ten MRSA MW?2-infected mice per group (n= 10 biologically 
independent animals, see Supplementary Methods) were treated with 
control (5% Kolliphor + 5% ethanol, i-p.), vancomycin (25 mg kg, ip.) 
gentamicin (30 mg kg“', s.c.), CD437 (20mg kg™', ip.), or a combination 
of CD437 (20 mg kg™', ip.) and gentamicin (30 mg kg~', s.c.) every 12h 


27. Schadendorf, D. et a/. Treatment of melanoma cells with the synthetic retinoid 
CD437 induces apoptosis via activation of AP-1 in vitro, and causes growth 
inhibition in xenografts in vivo. J. Cell Biol. 135, 1889-1898 (1996). 

28. Langdon, S. P. et al. Growth-inhibitory effects of the synthetic retinoid CD437 
against ovarian carcinoma models in vitro and in vivo. Cancer Chemother. 
Pharmacol. 42, 429-432 (1998). 


for 3 days beginning 24h after infection. At 12h after the last treatment, 
mice were euthanized and their thighs were excised and homogenized, 
and blood was collected and analysed for ALT and BUN. a, CFUs from 
each mouse thigh are plotted as individual points and the mean + s.d. 

for each experimental group is shown. b, c, Concentration of ALT for 
each mouse serum sample (b) and absorbance of BUN at 340 nm (c) are 
plotted as individual points. The mean + s.d. for each experimental group 
is shown. Statistical differences between control and antibiotic treatment 
groups were analysed by one-way ANOVA and post hoc Tukey test 
(P< 0.0001). 


29. Ponzanelli, |. et al. Isolation and characterization of an acute promyelocytic 
leukemia cell line selectively resistant to the novel antileukemic and 
apoptogenic retinoid 6-[3-adamantyl-4-hydroxyphenyl]-2-naphthalene 
carboxylic acid. Blood 95, 2672-2682 (2000). 
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Whole-organism clone tracing using single-cell 


sequencing 


Anna Alemany", Maria Florescu!, Chloé S. Baron!*, Josi Peterson- Maduro! & Alexander van Oudenaarden! 


Embryonic development is a crucial period in the life of a 
multicellular organism, during which limited sets of embryonic 
progenitors produce all cells in the adult body. Determining 
which fate these progenitors acquire in adult tissues requires the 
simultaneous measurement of clonal history and cell identity at 
single-cell resolution, which has been a major challenge. Clonal 
history has traditionally been investigated by microscopically 
tracking cells during development!”, monitoring the heritable 
expression of genetically encoded fluorescent proteins* and, more 
recently, using next-generation sequencing technologies that 
exploit somatic mutations‘, microsatellite instability*, transposon 
tagging®, viral barcoding’, CRISPR-Cas9 genome editing® 3 and 
Cre-loxP recombination". Single-cell transcriptomics!* provides 
a powerful platform for unbiased cell-type classification. Here we 
present ScarTrace, a single-cell sequencing strategy that enables 
the simultaneous quantification of clonal history and cell type 
for thousands of cells obtained from different organs of the adult 
zebrafish. Using ScarTrace, we show that a small set of multipotent 
embryonic progenitors generate all haematopoietic cells in the 
kidney marrow, and that many progenitors produce specific cell 
types in the eyes and brain. In addition, we study when embryonic 
progenitors commit to the left or right eye. ScarTrace reveals that 
epidermal and mesenchymal cells in the caudal fin arise from the 
same progenitors, and that osteoblast-restricted precursors can 
produce mesenchymal cells during regeneration. Furthermore, 
we identify resident immune cells in the fin with a distinct clonal 
origin from other blood cell types. We envision that similar 
approaches will have major applications in other experimental 
systems, in which the matching of embryonic clonal origin to adult 
cell type will ultimately allow reconstruction of how the adult body 
is built from a single cell. 

The goal of our experiment is twofold: first, to link cells in the 
embryo to their corresponding clones in adult tissue (Fig. 1a); sec- 
ond, to quantify cell-type composition of these clones to determine 
the multipotency of embryonic progenitors. To reach the first goal, we 
need to uniquely label the cells in an embryo with permanent and herit- 
able labels. For this, we use CRISPR-Cas9 technology, which induces a 
double-stranded break at the targeted genomic site that is repaired 
as insertions or deletions of different lengths at different positions 
(scars)!®!7, To allow for multiple scarring in the same cell, we use a 
zebrafish line with eight in-tandem copies of a histone-green fluor- 
escent protein (GFP) transgene!” (Methods). Scarring starts after 
injecting the yolk or cell of the zygote with Cas9 RNA or protein, and 
a single-guide RNA (sgRNA) that targets GFP (Fig. 1b). 

We quantified the scarring rate by measuring the fraction of 
unscarred GFP in zebrafish embryos at different times after Cas9 
delivery (Fig. 1c), which is five times faster in Cas9 protein than in 
RNA injections. Cas9 activity ceases at around 3h for protein and at 
10h for RNA injections, when zebrafish embryos have about 1,000 and 
8,000 cells, respectively”. We detect more than 1,000 distinct scars, the 


abundances and probabilities of which span several orders of magni- 
tude!’ (Supplementary Information sections 1 and 2). 

To detect scars and transcriptome from single cells, we developed 
ScarTrace, which integrates a nested PCR step after transcriptome 
conversion to cDNA into the sorting and robot-assisted transcriptome 
sequencing (SORT-seq) protocol! (Fig. 1d). Because the histone- 
GFP transgene is transcribed, scars can be detected from mRNA and 
genomic DNA (gDNA). Detection from gDNA is preferred because 
GFP expression might be tissue specific, vulnerable to silencing and 
scars might affect the half-life of the mRNA. We assessed the efficiency 
of scar detection from mRNA and gDNA by comparing scar patterns 
of single cells from the caudal fin obtained using ScarTrace with and 
without reverse transcription (Fig. 1d, step 1). We detected 3.3 + 0.3 
(mean +s.e.m.) scars per clone on average, and approximately 25% of 
the cells remained unscarred and therefore do not contain clonal infor- 
mation (Extended Data Fig. 1a). Clone sizes from gDNA and gDNA- 
mRNA detection are very similar (Extended Data Fig. 1a—d), indicating 
that ScarTrace reliably detects scars from gDNA in single cells. 

We next used ScarTrace to explore the clonal composition of hae- 
matopoietic cells isolated from the whole kidney marrow (WKM) of two 
protein-injected (P1 and P2) and two RNA-injected (R1 and R2) zebrafish. 
We found one and two major clones in P1 and P2, and eight and six in 
R1 and R2, respectively (Fig. 2a, b, Extended Data Fig. 2a, b, Extended 
Data Table 1). This is a direct result of the time window of Cas9 activity 
(Fig. 1c). The number of observed clones agrees with previous findings 
using GESTALT®, in which a similar Cas9-mediated approach is used 
to label embryonic clones in zebrafish, and with the number of clones 
(between 10.4 and 15.4) found for haematopoietic stem and progenitor 
cells at 10-14hours post fertilization (hpf) using Zebrabow””. 

The average number of scars per clone equals 3.3 + 0.3 for P1, 
1.02 £0.01 for P2, 3.5 -£0.3 for R1 and 3.0 £0.3 for R2, with a minimum 
of 1 scar and a maximum of 5 scars per clone, revealing that both Cas9 
protein and RNA efficiently cause scarring. We determined the copy 
number for each scar in a clone by modelling the amplification and 
sequencing noise of ScarTrace as a branching process (Supplementary 
Information section 3). Typically, the resulting number of scars per 
clone is smaller than eight, as a consequence of two or more simultane- 
ously Cas9-induced cuts in the same multi-copy tandem histone-GFP 
gene’®. We computed the P value of a combination of scars to occur in 
a cell (Fig. 2a, b). Values obtained are commonly below 10~, empha- 
sizing that although identical scars might be independently introduced 
in different clones (for example, the yellow scar is present in one clone 
from fish R1 and four clones from fish R2), the chance of introducing 
the same combination of scars in independent clones is very small. 
Consistently, we do not find overlapping clones between different 
zebrafish. Using cell-to-cell variation in scar composition, we estimate 
a 90% scar detection efficiency (including unscarred GFP; Extended 
Data Fig. le, f). In addition, by assuming maximum parsimony for 
sequential scarring events, we build lineage trees for clones (Fig. 2c, d, 
Extended Data Fig. 2c, d, Supplementary Information section 4). 
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Using RaceID”° (Methods), we identify eight haematopoietic cell 
types in fish R1 and R2 (Fig. 2e). Gene expression profiles in the 
different cell types found for both fish are identical with the exception 
of erythrocytes, which show slight differences in the expression of 
characteristic markers (Extended Data Fig. 2e-h). After combining 
cell type and clonal information for single cells, we observe all clones 
in all cell types with similar proportions (Fig. 2f, g, Extended Data Fig. 
2i, j), indicating that all clones contribute to the production of all blood 
cells. This is consistent with haematopoietic stem and progenitor cells 
specification (around 28 hpf), when scarring is already completed”". 

Next, we used ScarTrace in the adult brain and eyes of two RNA- 
injected fish (R2 and R3), in which we identified different neuronal, 
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glia and immune cells (Fig. 3a, Extended Data Fig. 3). To determine 
clonal enrichment or depletion in certain cell types quantitatively, we 
used Fisher's exact test (Fig. 3b, Extended Data Fig. 4a). Here, several 
clones only generate neurons or retinal interneurons (Extended Data 
Fig. 4b, c). We observed that microglia share clones with the WKM, 
confirming that they originate from the WKM”. 

Upon the exclusion of WKM clones, we found that clones are not 
only cell-type specific, but also brain-region and eye specific (Extended 
Data Figs 4d-g, 5a—d). Although R2 and R3 left and right midbrains 
share a small fraction of clones, left and right eyes share none (Fig. 3c, 
Extended Data Fig. 4h). However, for fish P1, both midbrains share 
almost all clones whereas eyes share only one. To explore when this 


Figure 2 | Few clones produce haematopoietic 
cells. a, b, Scar percentage per cell for fish P1 
(a) and R1 (b) (for key, see Extended Data 
Fig. 2a). The bar above each panel indicates 
clones and corresponding P values. uGFP, 
unscarred GFP. ¢, d, Lineage trees for clones 
in Pl (c) and R1 (d). The root is an unscarred 
clone. Clones with corresponding cell fraction 
and scar copy number (Supplementary 
Information section 3) are at the tips. The 
statistical confidence of each branch is 
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e, t-distributed stochastic neighbour embedding 
(t-SNE) map of cells from fish R1 and R2 
obtained with transcriptome data. Colours 
indicate the cell type (Extended Data Fig. 2). 
HSPCs, haematopoietic stem and progenitor 
cells. f, t-SNE map of cells in fish R1. Colours 
indicate the clone. g, Clonal cell fraction per cell 
type for fish R1. 
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Figure 4 | Clonality during caudal fin regeneration. a, Primary 

(first week), secondary (fully regenerated after 3 weeks) and tertiary 
(fully regenerated after 6 weeks) fins are amputated. Tertiary fins are 
split dorsally and ventrally. b, t-SNE map of cells in fish R4, R5 and R6. 
Colours indicate cell type (Extended Data Fig. 6). c, Heat map of clonal 
cell fraction per cell type and fin in fish R4, and two-sided Fisher's exact 
test for clones that are enriched and depleted with P < 0.05. The bars at 
the top show clones in the WKM, number of cells and P value per clone. 
d, t-SNE map of cells from fish R4, with cells from the primary (top) or 
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regenerated (bottom) fin detected in osteoblast clones or remaining cell 
types. Percentages are mesenchymal cell fractions that share clones with 
osteoblasts. e, Magnified view of t-SNE map indicating immune cells with 
fin-specific clones, immune cells that share clones with the WKM 

and cells with no clone information (grey). Subpopulations of lymphoid 
cells and myeloid cells (numbered 1-4), and the percentage of RICs are 
indicated. f, Differential gene expression between the four subpopulations 
of myeloid cells (Supplementary Table 6). g, Cell type fraction for 
fin-specific clones detected in RICs in the primary fin for R4. 
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segregation is established, we injected one cell at the two-cell stage 
with Cas9-eScarlet fusion protein and sgRNA. We found Cas9-eScarlet 
protein present in only half of the embryo at dome stage (Extended 
Data Fig. 4i-l), approximately 3 h after Cas9 protein stops scarring. 
Therefore, scars only occur in one side of the embryo. However, 
ScarTrace on the left and right eyes of a 3-week-old-injected embryo 
(S1) reveals scars in both eyes. Upon removal of the clones found in 
immune cells and erythrocytes, the rest of the clones are specific to each 
eye (Fig. 3c). This indicates that both eyes get cellular contributions 
from both sides of the dome-stage embryo. To determine further when 
lateral commitment arises in eye progenitors, we built lineage trees for 
clones detected in the left and right eyes or midbrain for fish P1 and R2 
(Extended Data Fig. 5e-h). In P1, no significant co-evolution is found 
among clones from the right (left) eye. By contrast, in R2 we observe a 
significant depletion of right eye clones evolving with left eye clones. 
This suggests that progenitors commit to the left or right eye shortly 
before the end of scarring with Cas9 protein. No significant co-evolution 
enhancement or depletion is found for clones detected in the left and 
right midbrain, indicating that cell mixing is important at 10 hpf. This 
is consistent with the processes of neurulation and neurogenesis”. 

Next, we focused on zebrafish caudal fin ontogeny and regen- 
eration. We performed ScarTrace on the primary, secondary and 
tertiary fins of fish R4, R5 and R6 (Fig. 4a). We identified four major 
cell types (osteoblasts, mesenchymal, epidermal and immune cells) and 
observed cell-type-restricted clones in all fish (Fig. 4b, c, Extended Data 
Figs 6, 7a-e). We found that mesenchymal and epidermal cells share 
clones, revealing a common developmental origin that is maintained dur- 
ing regeneration. Together with previous imaging-based studies”, this 
suggests that epidermal ancestors undergo epithelial-to-mesenchymal 
transition during gastrulation to generate mesenchymal cells in the 
caudal fin. Osteoblasts did not share clones with any other cell type 
in the primary fin and showed dorsal-ventral segregation, confirm- 
ing their early lineage commitment during development!*”>?8. We 
found lineage restriction of the different cell types as the main mecha- 
nism of fin regeneration, consistent with previous results?>*®, However, 
after regeneration, we observed osteoblast-committed clones that 
generate a fraction (approximately 21% in R4, 44% in R6) of mesenchymal 
cells (Fig. 4d, Extended Data Fig. 7f). This suggests a certain degree of plasticity 
after injury, in which progenitors that produce osteoblasts during development 
can also give rise to mesenchymal cells during fin regeneration”. 

Finally, we investigated the clonal overlap of single cells from the WKM of 
fish R4, R5 and R6 with immune cells found in the fin. Clones detected in the 
WkKM are enriched in the fin immune cells and depleted in the remaining 
cell types (Fig. 4c, e, Extended Data Fig. 7g). We found sub-populations of 
lymphoid and myeloid cells in all fins with different proportions of fin-spe- 
cific clones, which we identify as resident immune cells (RICs). Differential 
gene expression analysis in myeloid cells revealed that subpopulation 4 
expressed macrophage markers together with the epithelial marker epcam 
(Fig. 4f), which has been reported in resident macrophages in mice*”. All 
RICs in the primary fin share clonality with epidermal and mesenchymal cells 
(Fig. 4g, Extended Data Fig. 7b, c). Therefore, our data indicate that RICs 
have a distinct origin from haematopoietic stem cells (Extended Data Fig. 7h), 
and arise either from epidermal and mesenchymal transdifferentiation, or 
from ectodermal ancestors similarly to mesenchymal cells. 

We developed ScarTrace as a new method to quantify clonal origin 
and cell type simultaneously at single-cell resolution. This enabled us to 
investigate the embryonic origin of clones found in different organs of 
the adult zebrafish and their cell-type commitment during development 
and regeneration. CRISPR-Cas9 genome editing technology for lineage 
tracing purposes at the single cell level has recently also been used in 
zebrafish to investigate lineages and cell types in the vertebrate brain, 
and to unravel developmental lineages*'*”. We anticipate many applica- 
tions of ScarTrace in developmental and stem-cell biology, and similar 
approaches to study clonal selection in cancer models. Because ScarTrace 
provides a glimpse of the cellular past, it will be interesting to explore how 
this history is predictive of the current epigenetic and expression state. 
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Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Zebrafish Cas9 and sgRNA injections. Heterozygous zygotes of the transgenic 
zebrafish line Tg(h2afva:GFP)**°°:(h2afva:GFP)K* (ref. 17) were injected at 
the cell with 1 nl Cas9 protein (NEB; final concentration 1,590 ng ul!) or at the 
yolk with 1 nl Cas9 RNA (300 ng!) in combination with an sgRNA that tar- 
gets GFP (25 ng yl, sequence: GGTGTTCTGCTGGTAGTGGT) (Fig. 1b). Cas9 
RNA was in vitro transcribed from a linearized pCS2-nCas9n vector (Addgene 
plasmid 47929)!° using the mMESSAGE mMACHINE SP6 Transcription Kit 
(Thermo Scientific). The sgRNA was in vitro transcribed from a template using 
the MEGAscript. T7 Transcription Kit (Thermo Scientific). The sgRNA template 
was synthesized with T4 DNA polymerase (New England Biolabs) by partially 
annealing two single-stranded DNA oligonucleotides containing the T7 promotor 
and the GFP-binding sequence, and the tracrRNA sequence, respectively**. Male 
and female zebrafish were used, no randomization was done, no blinding was 
done and no animals were excluded from the analysis. No statistical methods were 
used to predetermine sample size. The age of the fish used in isolated organ spans 
3-18 months. For sample sizes, see Extended Data Table 2. All animal experiments 
were performed in accordance with institutional and governmental regulations, 
and were approved by the Dier Experimenten Commissie of the Royal Netherlands 
Academy of Arts and Sciences and performed according to the guidelines. 
Transgene copy number. To determine the number of integrations of the trans- 
gene, we performed whole-genome sequencing (NEBNext Ultra library prepa- 
ration kit for Illumina (E7370S) and the NEB Multiplex Oligos for Illumina 
(E7500L)) on an homozygous Tg(h2afva:GFP)**®:(h2afva:GFP)*® fish. Paired- 
end data were trimmed (TrimGalore-0.4.3) and mapped (bwa-0.7.10 mem) to the 
zebrafish reference genome (danRer10 from UCSC Genome Browser), and PCR 
and optical duplicates were removed (Picard-2.0.1) (Extended Data Fig. 8a, b). 
The copy number was extracted using FREEC-11.0°” with default parameters. With 
a 1-kb window size, we find 19 copies of the transgene fragment, whereas with a 
500-bp window size, we find 18 (Extended Data Fig. 8b). After correcting for reads 
due to endogenous copies, we estimate the number of copies of the transgene in 
a heterozygous fish to be 8 + 1. This number agrees with single-cell data, because 
although we detected a maximum of 7 scars per clone (Extended Data Fig. 8c, d), 
we see that sometimes 6 of the scars in those clones represent approximately 12.5% 
of the scar content per cell, and one represents approximately 25% (Extended Data 
Fig. 8e). This again suggests that the number of integrations of the histone-GFP 
transgene is 1/0.125=8. 

ScarTrace protocol. Live single cells (based on DAPI exclusion and scatter 
properties) were sorted into 384-well plates (Sigma-Aldrich) containing 51] of 
mineral oil (Sigma-Aldrich), 50 nl of uniquely barcoded reverse transcription prim- 
ers (Supplementary Table 1), dNTPs (Promega), Spike-in controls (Thermo Fisher) 
and RNase inhibitor (SUPERaseln, Thermo Fisher). Plates were immediately spun 
down and stored at —80°C. Cells were lysed at 65°C for 5 min. Reverse transcrip- 
tion and second-strand synthesis mixes were dispensed into each well using the 
Nanodrop II and reactions were performed at 42 and 16°C degrees, respectively 
(Fig. 1d, step 1). Genomic DNA was access by proteinase K treatment followed by 
a nested PCR strategy to amplify the scarred GFP region (Fig. 1d, step 2). In the 
second PCR, unique scar barcodes were introduced in each well (Supplementary 
Table 2). All cells were pooled and the aqueous phase was separated from the 
oil phase (Fig. 1d, step 3). The collected material was split for scar library and 
transcriptome library preparation (Fig. 1d, step 4). For transcriptome library 
preparation, the SORT-seq protocol!* was used (Fig. 1d, step 5a). For scar library 
preparation, a PCR introducing only Illumina TruSeq adapters was perfomed 
(Fig. 1d, step 5b). All libraries were sequenced paired-end at 75 bp read-length on 
the Illumina NextSeq platform. A detailed description of the protocol is available 
in Protocol Exchange™. 

WEM isolation. The WKM was isolated as previously described™. A ventral 
midline incision was made to open the adult zebrafish body cavity. All internal 
organs were carefully removed to access the kidney. The WKM was collected in 
PBS supplemented with FCS. The tissue was aspirated through a 1 ml pipet tip sev- 
eral times to mechanically dissociate haematopoietic cells. After two consecutive 
filtering steps (using 70-j1m and 40-|.m cell strainers (VWR), cells were centrifuged 
and washed. The pellet of haematopoietic cells was resuspended in PBS and FCS 
supplemented with DAPI (Thermo Fisher) to assess cell viability 

Brain parts and eye isolation. Brain and eyes were isolated from the zebrafish 
head and dissected in PBS. Optic nerves were removed. The forebrain (olfactory 
bulb and telencephalon) was isolated from the midbrain, followed by dissection of 
the hindbrain (rhombencephalon). The midbrain (mesencephalon) was dissected 
into left and right midbrain. The eyes lens was carefully removed. Brain parts 
and eyes were dissociated into single cells using a papain-based solution (Thermo 
Fisher, 88285) and washing solutions as previously described*®. The washed cell 
pellet was resuspended in DMEM/F12 medium (Thermo Fisher, 11320033) and 
supplemented with DAPI (Thermo Fisher) to assess cell viability for FACS. 
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Fin amputation. Caudal fin amputations were performed as previously 
described**, after which fish were returned to 28°C aquarium water. Once 
isolated, this tissue was immediately dissociated by moderately shaking at 30°C for 
1h, with gentle trituration performed every 10 min with a p200 pipet, in a solution 
of 2mg ml ! collagenase A (Sigma-Aldrich) and 0.3 mg ml”! protease (type XIV, 
Sigma-Aldrich) in Hanks solution. After 1h, the solution was incubated for 5 min 
in 0.05% trypsin in PBS. The solution was strained using 70-j1m and 40-j1m cell 
strainers (Corning) and cells were washed in 2% FBS in Hanks solution. Before flow 
cytometry, cells were centrifuged and resuspended in PBS and FBS supplemented 
with DAPI (Thermo Fisher) to assess cell viability. 

Transcriptome analysis. In transcriptome libraries, the first read contains cell 
barcode (Supplementary Table 1) and unique molecular identifier (UMI) infor- 
mation, and the second read contains biological information. Second reads with 
a valid cell barcode extracted from corresponding first reads are mapped using 
bwa mem-0.7.10 with default parameters to the reference zebrafish transcriptome 
(Danio rerio assembly Zv9, ensemble 74, extended with ERCC92). For each cell, 
the number of transcripts per gene was obtained as previously described*’. We 
refer to transcripts as unique molecules based on UMI correction. We ran RaceID3 
with different parameters for each organ under study (Supplementary Data 1) for 
cell filtering, normalization, gene filtering, cell clustering and differential gene 
expression analysis (in which P values are calculated using negative binomial dis- 
tribution and corrected for multiple testing by the Benjamini-Hochberg method). 
The choice of filtering parameters was made to include the maximum number of 
cells in our analysis without losing cell type information. Supplementary Tables 3-6 
provide results for the differentially expressed genes for each cell type compared 
with all other cells in the organ: WKM**~4! (90 dendritic cells, 76 eosinophils, 
641 erytrhocytes, 516 haematopoietic stem and progenitor cells, 446 lymphocytes, 
409 monocytes, 927 neutrophils and 76 thrombocytes), brain and eyes*?->! (250 
bipolar and horizontal cells, 45 COPCs, 9 cones, 290 erythrocytes, 254 immune 
cells, 88 glia-like cells, 89 MFOLs, 66 microglia, 1,427 neurons, 10 OPCs, 31 RCL, 
53 radial glia and 202 rods), caudal fin°?-** (144 epidermal cells, 2,834 fibroblasts, 
1,784 immune cells, and 2,951 osteoblasts), and resident myeloid cell types in the 
fin (118 cells in subpopulation 1, 45 in subpopulation 2, 27 in subpopulation 3 and 
133 in subpopulation 4). 

Scar analysis. In scar libraries, the first read contains the cell barcode 
(Supplementary Table 2) and the forward primer used in the nested PCR and 
second read contains the sequence for the scar and the reversed primer. Scripts 
to extract scars and detect clones are provided as Supplementary Data 2, together 
with a reference manual (Supplementary Data 3). Bug fixes and updates of the 
scripts can be downloaded from https://github.com/anna-alemany/scScarTrace. 
Cells sharing an identical scar pattern are assumed to come from the same clone, 
independently of scar percentage. Cells with a detected scar pattern that can be 
assigned to another single clone by assuming that some scar was not sampled were 
pooled with that clone. Cells that according to their scar pattern can be ambigu- 
ously assigned to two or more other clones were removed from subsequent analysis. 
Clones with less than three cells were also removed. 

Code availability. Transcriptome analysis was performed using RaceID3 available 
at https://github.com/dgrun/RaceID3_StemID2, with parameters summarized in 
Supplementary Data 1. Scripts for scar extraction and clone detection are provided 
in Supplementary Data 2, together with a reference manual (Supplementary Data 3). 
Bug fixes and updates of the scripts can be downloaded from https://github.com/ 
anna-alemany/scScarTrace. 

Data availability. The accession numbers for the RNA sequencing datasets 
reported in this paper have been deposited with the Gene Expression Omnibus 
(GEO) under accession GSE102990. 
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Extended Data Figure 1 | gDNA versus gDNA-mRNA detection of 
scars. a, Scar percentage per cell (top bar indicates clones), and pie chart of 
fraction of cells per clone (colours matching histograms’ top bar) detected 
via gDNA (ScarTrace without step 1) and gDNA-mRNA detection (full 
protocol). b, Number of detected cells per clone in gDNA versus gDNA- 
mRNA detection and Pearson's correlation coefficient computed using 

the 20 different clones identified. Dot sizes are proportional to the total 
number of cells found taking together the two detection strategies. 

c, Fraction of cells detected per clone in gDNA (green) and gDNA-mRNA 
(purple) detection, in which clone ‘0’ represents unscarred cells. d, Top, 
one-sided Fisher’s exact test on a contingency table made of the number 
of cells detected for the given scar clone x for each detection strategy 

(a and b, respectively), and the number of cells taking together all other 
clones (c and d, respectively) found in gDNA detection (n= 147 cells in 
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total) and gDNA-mRNA detection (n = 128 cells in total). Bottom, heat 
map shows the one-sided P value of each scar clone to be enriched in 
the gDNA or mRNA/gDNA detection protocol. No enrichment is found 
with P< 0.05, therefore results found for the two detection protocols 

are compatible. e, Normalized histograms and corresponding fit noise 
model function (grey line; Supplementary Information section 4) for 
the scar percentage detected for 1 clone found in fish P1. Scar detection 
efficiency is defined as the area above 5% scar content (vertical red line). 
Efficiency of detection of unscarred molecules is assumed to be the same 
as for scarred molecules. f, Normalized histogram of the scar detection 
efficiencies found after pooling all clones from all organs for all fish 

(in total, n = 371 detected clones; Extended Data Table 1). The vertical 
black line and the grey area indicate the mean scar detection efficiencies 
and s.e.m., respectively. 
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Extended Data Figure 2 | Transcriptome analysis of the zebrafish WKM. 
a, b, Scar percentage per cell for fish P2 (a) and R2 (b). The bar above 
each panel indicates clones and corresponding P values. c, d, Lineage trees 
for clones detected in P2 (c) and R2 (d) obtained as described in Fig. 2. 

e, t-SNE map of cells from fish R1 and R2. Colours and numbers indicate 
RaceID clusters. f, Heat map of the Pearson correlation between cells 
sorted according to RaceID clusters. Cluster numbers are indicated on the 
x and y axes. g, t-SNE map for the WKM of R1 and R2 coloured according 


to fish of origin (R1 in pink and R2 in green). All cell types intermingle 
well, except erythrocytes. Even though erythrocytes appear separately on 
the t-SNE space, they belong to the same RaceID cluster. h, t-SNE maps for 
R1 and R2, coloured according to the number of unique transcripts 

per single cell for each marker**“". A full list of marker genes for each cell 
type is available in Supplementary Table 3. i, ‘SNE map for fish R2 with 
cells coloured according to clone. j, Clonal cell fraction per cell type for 
fish R2. 
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Extended Data Figure 3 | Cell types and batch effects in the brain 

and eyes for fish R2 and R3. a, t-SNE map obtained with RaceID of 
pooled cells with a minimum of 100 total transcripts from fish R2 and R3 
(isolated from the right and left eyes, right and left midbrain, forebrain 
and hindbrain). Different symbols indicate cells with different minimal 
total transcript counts. Cells are coloured according to the assigned cell 
type using the lowest cut off (that is, taking into account cells with at least 
100 transcripts). We do not lose any cell type cluster when applying higher 
transcript cut offs, nor do we generate new clusters of low transcript cells 


when applying lower cut offs. The fraction of cells that would be termed a 
different cell type with a higher cut off is very low (<1%). Low transcript 
cells cluster mainly around the clusters formed by high transcript cells. 

b, c, SNE maps as in a, in which cells are coloured according to organ 

(b) and fish (c) of origin. d, t-SNE map as in a, but showing only cells from 
fish R3, with corresponding cell types indicated. e, t-SNE maps for fish 

R2 and R3 coloured according to the number of unique transcripts per 
single cell*?-°!. A full list of marker genes for each cell type is available in 
Supplementary Table 4. 
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Extended Data Figure 4 | See next page for caption. 
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Extended Data Figure 4 | ScarTrace in the zebrafish brain and eyes. 

a, Heat map of the fraction of cells per clones for cell types in fish R3 
(COP, OPC and MFOL clones merged as oligodendrocytes), and two-sided 
Fisher’s exact test for enriched (magenta upwards triangle) and depleted 
(blue downwards triangle) clones per cell type with P < 0.05. The bars at 
the top depict organ and corresponding total number of cells. All clones 
have P values < 107°. b, c, -SNE map of fish R2 and R3 cells showing 
different colours for enriched clones detected in glia cells, neurons or 
retinal interneurons. Other cells are shown in grey. d, e, Scar percentage 
per cell for clones found in the WKM, forebrain, hindbrain, left and right 
eyes, and left and right midbrain for R2 and R3. In all panels, each colour 
represents the same scar (for example, the yellow scar is the same for R2 
and R3), and unscarred GFP is shown in green. f, g, Heat maps of the 
fraction of cells per clones for each organ for R2 (f) and R3 (g). Enriched 
(magenta upwards triangles) and depleted (blue downwards triangles) 


LETTER 


scar clones per organ are determined by a two-sided Fisher's exact test 
with P< 0.05. The bar above each panel depicts the number of cells and 

P value for each clone. h, Histograms of the relative clone frequency in 

the left (blue) and right (red) midbrain and eye for R3. i, Image of dome- 
stage embryo injected with Cas9-eScarlet in one cell at the two-cell stage 
(n> 10 embryos showed similar patterns). BF, bright-field. j, Scarring 
efficiency shown as the percentage of unscarred GFP in S1, P1 and P2 for 
the left and right eyes, and the WKM. k, t-SNE map of cells isolated from 
the left and right eyes of S1, in which cells are coloured according to their 
cell type. 1, Heat map of the fraction of cells per clones for each cell type in 
S1. Enriched (magenta upwards triangle) and depleted (blue downwards 
triangle) scar clones per cell type are determined from a two-sided Fisher's 
exact test with P< 0.05. The bars at the top depict the total number of cells 
and the fraction of cells found in the right eye for each clone. All P values 
are below 10-°. 
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Extended Data Figure 5 | See next page for caption. 
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Extended Data Figure 5 | Clones for fish P1 and P2 and lineage tree 

of the eyes and midbrain. a, b, Scar percentage per cell for clones found 
in the WKM, forebrain, hindbrain, left and right eyes, and left and right 
midbrain for P1 (a) and P2 (b). Each colour represents a different scar, in 
which unscarred GFP is always shown in green. Colour legend per scars is 
different between panels (yellow scar in a is not yellow scar in b). c, d, Heat 
maps of the fraction of cells per clones for each organ for P1 (c) and P2 
(d). Enriched (magenta upwards triangles) and depleted (blue downwards 
triangles) scar clones per organ are determined from a two-sided Fisher's 
exact test with P< 0.05. The bar above each panel depicts the number of 
cells and P value for each clone. e-h, Lineage trees obtained assuming 

the principle of maximum parsimony as described in Supplementary 
Information section 5 for clones detected in the right and the left eyes 

(e, g) and right and left midbrain (f, h) of fish P1 (e, f) and R2 (g, h). 

The root of the trees is set as an unscarred clone, with eight copies of the 
GFP transgene. In the tips there are the detected clones. The statistical 
confidence of each branch is computed as the proportion of each 

branch among 10,000 tree replicates constructed by bootstrapping scars 
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present in all clones. To assess statistically whether clones from the left 
or the right side co-evolve together, we randomized the clones at the 
tips of the tree and checked how many times, randomly, clones from the 
right or the left were found to be sisters with other clones from the 

right or the left. This allowed us to build a distribution of co-evolution 
(histograms in each tree) of clones for the null hypothesis and check 
whether the number of times we saw clones from one side together was 
statistically significant or not. The vertical dashed line in each 
histogram indicates the number of times we see clones from one side 
together as sisters in the reference tree. When such line is found at 

the right-hand (left-hand) side of the maximum, we assume that the 
coevolution of the clones is enhanced (depleted). In the heat maps, we 
indicate the degree of co-evolution of clones in the right or the left eye 
or midbrain, computed as the fraction of the area of the histogram at the 
right- or the left-hand side (that is, enhanced or depleted co-evolution, 
respectively) of the vertical line divided by the corresponding area 

of the histogram at the right- or left-hand side of maximum of the 
distribution. 
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Extended Data Figure 6 | Transcriptome analysis of the zebrafish caudal A complete list of marker genes used for each cell type is available in 


fin. a, t-SNE maps obtained by pooling together cells from primary and Supplementary Table 5. b, t-SNE maps for the caudal fin of R4, R5 and R6 
regenerated fins from fish R4, R5 and R6. In each panel, single cells are coloured based on fish (left) and fin version (right) of origin. All cells are 
coloured according to the number of unique transcripts observed for a present in all fins and fin version. No batch effects are observed. 


given gene. The corresponding cell type is indicated in parenthesis‘. 
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Extended Data Figure 7 | Clonal analysis in the caudal fin. a, Scar 
percentage per cell in clones detected in fish R4 (left), R5 (middle) and R6 
(right). The corresponding organ is indicated above each barplot (WKM 
or fin version). Spatial information (dorsal or ventral) is indicated when 
available. The bars at the top indicate clones. Each colour represent a 

scar, the same colour scheme is used for all panels. b, c, Heat maps of the 
fraction of cells per clones for each cell type and fin in fish R5 (b) and R6 
(c). Enriched (magenta upwards triangle) and depleted (blue downwards 
triangle) scar clones per cell type per primary, secondary and tertiary fin 
obtained from two-sided Fisher’s exact test with P < 0.05. Top bars depict 
clones found in the WKM of the same fish, the corresponding number 

of cells, and the P value for each clone. d, t-SNE map of R6, in which 

cells with clone 24 (as a representative example of clones shared between 
mesenchymal and epidermal cells) are highlighted in red. e, t-SNE map 


of all cells detected in the caudal fin, in which cells from clone 19 (as 

a representative example of clones shared between mesenchymal and 
epidermal cells) in R4 are highlighted in red. f, t-SNE map of R6 primary 
(left) and regenerated (right) caudal fin cells (grey circles), in which 

cells from osteoblast clones are highlighted in red. Dashed lines represent 
mesenchymal cells (Fig. 4b, Extended Data Fig. 6). The percentages indicate 
the fraction of mesenchymal cells that share clones with osteoblasts. g, 
Magnified view of the t-SNE maps of R4, R5 and R6 for immune cells (dashed 
line on Fig. 4b). Cells are coloured based on fish (left) and fin version (right) 
of origin. Subpopulations of lymphoid (dashed circles) and myeloid (solid 
circles) are found in all fish and fin versions. h, Scar percentage for cells 
detected in the WKM (top) and RICs (bottom) for R4 (left), R5 (middle) and 
R6 (right) in the primary fins reveals the absence of common scars between 
the two. The bar above each panel indicates the different clones. 
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Extended Data Figure 8 | See next page for caption. 
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Extended Data Figure 8 | The histone-GFP transgene and scar 
characterization. a, Scheme of one copy of the h2afva:GFP transgene as 
previously described'’. b, Copy number of the transgene. Top, average 
number of reads in bin sizes of 1 kb and sliding window of 200 bp obtained 
in whole-genome sequencing data. Bottom, copy number extracted using 
FREEC-11.0 with default parameters (Methods). c, Number of clones 
detected with a given number of scars, obtained by pooling all data 

from all fish used in this study. d, Scar percentage per cell in a clone in 
which seven different scars are detected. e, Probability density function 
(normalized histogram) of the fraction of scars detected in the clone 
depicted in d (colour code as in d). f, Average fraction of a scar per embryo 
computed over ten independently injected embryos (for times larger than 
6h) and over three pools of ten embryos (for times lower or equal to 6h) 
detected from gDNA as a function of time for Cas9 RNA injections, for 
the six most observed scars (described with CIGAR strings). Error bars 
denote s.e.m. Solid green lines are the fit to equation (5) in Supplementary 
Information section 2. g, Top 100 observed scars, sorted according to their 
probability, and corresponding position of deletions (red) and insertions 
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(blue) along the GFP coordinate. h, Scar probabilities as a function of 
sorted scars. Error bars denote s.e.m. from the fit (in f). i, Probabilities 
of measuring the percentages x; for three scars with copy numbers 2 
(top), 1 (middle) and 1 (bottom), in which the expected percentages are 
i= 50%, 25% and 25%, respectively, present in the same cell with four 
surviving integrations of GFP. The probability has been obtained by 
independently simulating 1,000 times the ScarTrace protocol for ¢ = 0.50 
(left) and ¢ =0.85 (right). Solid green lines are the fit to equation (7) in 
Supplementary Information section 3. j, Scar percentage for cells from 
the same clone made of three scars, in which each scar is represented 
with a different colour. k, Corresponding probability density functions 
(normalized histogram) for the fraction of each scar per cell (colour 
code as in j). Black lines denote the best fit for each scar to equation 

(7) (see Supplementary Information). 1, Violin plot showing the 
distribution of measured P values obtained using a Gaussian kernel 
density with bandwidth determined using the Scott method™, for all 
clones with a given estimated Ngrp. Labels indicate the number of clones 
observed for Ncpp. 
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Extended Data Table 1 | Clone number in different organs 

WKM | Left eye | Right eye | Forebrain | Hindbrain | Left midbrain | Right midbran | Fin 

Rl 14 - - - - - - - 

R2 7 18 14 34 17 15 10 - 

< | R3 - 6 1. 20 16 14 22 - 
2lr4} 10 ‘ : ‘ : : : 25 
R5 3 - - - - - - 17 

R6 8 - - - - - - 39 

5 Pl 4 5 5 2 5) 6 7 4 

6, | P2 3 1 3 2 - 1 1 2 


Number of clones in different organs for different fish, for the two different Cas9 delivery strategies (RNA or protein). Unscarred clones are excluded and only clones with two or more ce 


considered. Whenever clones from the WKM are known for a fish (R1, R2, R4, R5, R6, Pl or P2), they are excluded from the clones detected in the other organs. 
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Extended Data Table 2 | Overview of the scarred fish and organs dissected for each 


fish Cas9 age organ cells 
(months) Scars Transcripts Combined 
Pl protein 18 WKM 286 
forebrain 209 
hindbrain 123 
left eye 98 
right eye 106 
left midbrain 95 
right midbrain 152 
c. fin (1) 186 
P2_ protein 18 WKM 544 
forebrain 175 
left eye 44 
right eye 45 
left midbrain 167 
right midbrain 151 
c. fin (1) 339 
Rl RNA 18 WKM 1738 1784 951 
R2 RNA 3 WKM 1673 1397 783 
forebrain 545 563 409 
hindbrain 133 179 64 
left eye 303 403 208 
right eye 95 250 78 
left midbrain 429 302 190 
right midbrain 238 135 78 
R3 RNA 3 forebrain 408 363 206 
hindbrain 152 
left eye 82 
right eye 204 336 171 
left midbrain 463 126 77 
right midbrain 458 287 186 
R4 RNA 18 WKM 625 
c. fin (1) 979 926 A79 
c. fin (2) 893 1148 538 
dorsal c. fin (3) 504 694 329 
ventral c. fin (3) 687 582 292 
R5 RNA 18 WKM 66 
c. fin (1) 1090 757 393 
c. fin (2) 1235 1022 673 
dorsal c. fin (3) 103 348 63 
ventral c. fin (3) 96 169 24 
R6 RNA 18 WKM 432 
dorsal c.fin (1) 318 576 135 
ventral c.fin (1) 205 538 66 
dorsal c.fin (2) 378 697 195 
ventral c.fin (2) 391 714 199 
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Number of cells sequenced per organ that survive filtering thresholds for transcriptome libraries, scars libraries and combined per scarred fish. The total number of sorted plates is indicated 
in parenthesis. In the caudal fin (c. fin), primary (1), secondary (2) or tertiary (3) are also indicated in parenthesis. 
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Itaconate is an anti-inflammatory metabolite that 
activates Nrf2 via alkylation of KEAP1 


Evanna L. Mills!?*4*, Dylan G. Ryan!*, Hiran A. Prag®, Dina Dikovskaya®, Deepthi Menon!, Zbigniew Zaslonal, 

Mark P. Jedrychowski*, AnaS. H. Costa’, Maureen Higgins®, Emily Hams®, John Szpyt*, Marah C. Runtsch!, Martin S. King’, 
Joanna F. McGouran’, Roman Fischer!, Benedikt M. Kessler!, Anne F. McGettrick!, Mark M. Hughes!, Richard G. Carroll, 
Lee M. Booty**, Elena V. Knatko®, Paul J. Meakin!', Michael L. J. Ashford'!!, Louise K. Modis‘, Gino Brunori!’, Daniel C. Sévin', 
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Albena T. Dinkova-Kostova®, Richard C. Hartley'*, Michael P Murphy°§ & Luke A. O’Neill>4s 


The endogenous metabolite itaconate has recently emerged as a 
regulator of macrophage function, but its precise mechanism of 
action remains poorly understood! °. Here we show that itaconate 
is required for the activation of the anti-inflammatory transcription 
factor Nrf2 (also known as NFE212) by lipopolysaccharide in mouse 
and human macrophages. We find that itaconate directly modifies 
proteins via alkylation of cysteine residues. Itaconate alkylates 
cysteine residues 151, 257, 288, 273 and 297 on the protein KEAP1, 
enabling Nrf2 to increase the expression of downstream genes with 
anti-oxidant and anti-inflammatory capacities. The activation of 
Nrf2 is required for the anti-inflammatory action of itaconate. We 
describe the use of a new cell-permeable itaconate derivative, 4-octyl 
itaconate, which is protective against lipopolysaccharide-induced 
lethality in vivo and decreases cytokine production. We show that 
type I interferons boost the expression of Irg1 (also known as Acod1) 
and itaconate production. Furthermore, we find that itaconate 
production limits the type I interferon response, indicating a 
negative feedback loop that involves interferons and itaconate. Our 
findings demonstrate that itaconate is a crucial anti-inflammatory 
metabolite that acts via Nrf2 to limit inflammation and modulate 
type | interferons. 

Macrophages have a key role in innate immunity. They respond 
rapidly to pathogens and subsequently promote an anti-inflammatory 
phenotype to limit damage and promote tissue repair. The factors driving 
these changes are incompletely understood. Itaconate, a metabolite 
synthesized by the enzyme encoded by Irg!', is increased in lipopoly- 
saccharide (LPS)-activated macrophages’ and has been suggested to 
limit inflammation by inhibiting succinate dehydrogenase (SDH), 
a crucial pro-inflammatory regulator’; however, the details remain 
unclear. 

Itaconate was the most abundant metabolite in LPS-treated human 
macrophages (Fig. la) and reached 5mM in mouse bone marrow- 
derived macrophages (BMDMs) after LPS stimulation (Fig. 1b, c). 
Itaconate can disrupt SDH activity, but is less potent than the classic 
SDH inhibitor malonate (Extended Data Fig. 1), suggesting that it may 
exert its anti-inflammatory effects via additional mechanisms. 

Itaconate contains an electrophilic a,3-unsaturated carboxylic acid 
that could potentially alkylate protein cysteine residues by a Michael 
addition to form a 2,3-dicarboxypropyl adduct. An attractive candidate 


protein that undergoes cysteine alkylation is KEAP1, a central player in 
the anti-oxidant response (Fig. 1d). KEAP1 normally associates with 
and promotes the degradation of Nrf2, but alkylation of crucial KEAP1 
cysteine residues allows newly synthesized Nrf2 to accumulate, migrate 
to the nucleus and activate a transcriptional anti-oxidant and anti- 
inflammatory program’. We therefore examined KEAP1 and Nrf2 as 
targets of itaconate. 

The cell-permeable itaconate derivative dimethyl itaconate (DMI)? 
boosted levels of Nrf2 protein, expression of downstream target genes, 
including Hmox1, and glutathione (GSH) (Extended Data Fig. 2a-d). 
However, the lack of a negative charge on the conjugated ester group 
in DMI increases its reactivity towards Michael addition, making it a 
far superior Nrf2 activator than itaconate akin to the potent Nrf2 acti- 
vator dimethylfumarate (DMF)®. DMI is rapidly degraded within cells 
without releasing itaconate’, hence is unlikely to mimic endogenous 
itaconate. Even so, these data indicate that Nrf2 activation is anti- 
inflammatory® (Extended Data Fig. 2e, f). 

To overcome the limitations of DMI, we synthesized 4-octyl itac- 
onate (OI), a cell-permeable itaconate derivative (Extended Data 
Fig. 3a). Itaconate and OI had similar thiol reactivity that was far lower 
than that of DMI (Extended Data Fig. 3b, c, f), making it a suitable 
cell-permeable itaconate surrogate. Furthermore, OI was hydrolysed 
to itaconate by esterases in mouse myoblast C2C12 cells (Extended 
Data Fig. 3d) and LPS-activated mouse macrophages (Extended Data 
Fig. 3e). OI boosted Nrf2 levels (Fig. le, compare lane 5 to lane 1) 
and enhanced LPS-induced Nrf2 stabilization (Fig. le, compare lane 6 
to lane 2), and increased the expression of downstream target genes’, 
including the anti-inflammatory protein HMOX1"° (Fig. 1f, g). We used 
a quantitative NAD(P)H:quinone oxidoreductase-1 (NQO1) inducer 
bioassay’, to assess the potency of Nrf2 activation by the CD value 
(concentration required to double the specific enzyme activity) for 
NQOl1, the prototypical Nrf2 target gene. OI (CD value of 2|1M), was 
more potent than the clinically used Nrf2 activator DMF (CD value 
of 6.5 |1M) (Fig. 1h, Extended Data Fig. 3f). OI stimulated synthesis of 
the key anti-oxidant GSH (Extended Data Fig. 3g-i). OI also boosted 
canonical activation of Nrf2 by the pro-oxidant hydrogen peroxide 
(H2O2) (Extended Data Fig. 3), k). Importantly, the related octyl esters 
4-octyl 2-methylsuccinate and octyl succinate, which are not Michael 
acceptors, had no effect on Nrf2 activity, confirming the requirement 
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for the itaconate moiety (Extended Data Fig. 31). Dimethyl malonate, 
a potent SDH inhibitor‘, did not activate Nrf2 (Extended Data 
Fig. 3m), confirming that Nrf2 activation by OI is independent of SDH 


inhibition. 


Itaconate is generated by IRG1 in the mitochondrial matrix and must 
cross the mitochondrial inner membrane to act on Nrf2 in the cytosol. 


Sa 
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Concentration (4M) 


shows mean. For gel source data, see 
Supplementary Fig. 1. 


Itaconate is structurally similar to malate, which is transported across 
the mitochondrial inner membrane by the dicarboxylate, citrate and 
oxoglutarate carriers. All three carriers transported itaconate, whereas 


other tested carriers could not (Fig. 2a and Extended Data Fig. 4), sug- 
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gesting that LPS-induced itaconate is generated in the mitochondrial 
matrix and is then exported to the cytosol to activate Nrf2. 


Figure 2 | Itaconate alkylates cysteines. 

a, Itaconate transport by the indicated 

carriers(n = 4). H3AAC1, Homo sapiens ADP/ 
ATP carrier; HsCTP, H. sapiens citrate carrier; 
HsODC, H. sapiens oxodicarboxylate carrier; 
HsOGC, H. sapiens 2-oxoglutarate carrier; 
ScAAC2, Saccharomyces cerevisiae ADP/ATP 
carrier; ScDIC, S. cerevisiae dicarboxylate carrier. 
b, Nrf2 and KEAP1 protein after co-transfection 
with Nrf2-V5, and the wild-type (WT) or 
Cys151Ser mutant KEAP1. c, Tandem mass 
spectrometry spectrum of Cys151-containing 
KEAP1 peptide after OI treatment. d, Metabolite 
(3C6-glucose (left), '*C;-glutamine (right)) 
tracing to itaconate-cysteine adduct with or 
without LPS (24h, n=5). AU, arbitrary units. 

e, LDHA Cys84 alkylation plus LPS (24h) or OI 
(250M, 4h) (n=4). Data are mean +s.e.m. 

(in d, e) or s.d. (in a). P values calculated using 
one-way ANOVA for multiple comparisons 

or two-tailed Student's t-test for paired 
comparisons. Blots are representative of three 
independent experiments. For gel source data, 
see Supplementary Fig. 1. 
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Figure 3 | OI limits IL-16 in an Nrf2-dependent manner and protects 
against LPS lethality. a—d, LPS (24h) induced I]1b mRNA (a, n= 3), 
IL-18 and HIF-1a protein (b), ROS production (c, n = 3; measured as 

the percentage change in mean fluorescent intensity (MFI) relative to 
untreated control) and nitrite production (d, n=3) + OL. e, LIB mRNA 
in human PBMCs treated as in a-d (n = 3). f, Survival (left), clinical score 
(middle) and body temperature (right) measurements in mice (n= 10) 
injected intraperitoneally with OI (50 mg kg, 2h) and LPS (15 mg kg’). 


Our hypothesis is that itaconate activates Nrf2 by alkylation of 
KEAPI cysteine residue(s)'*~’°, similar to the modification of cysteines 
by fumarate (Extended Data Fig. 5a). Cysteine 151 (Cys151) is a prin- 
cipal sensor on KEAP1 for sulforaphane!® and DMF". OI stabilized 
V5-tagged Nrf2 (Nrf2-V5) in COS1 cells co-expressing wild-type 
KEAP1 but not a Cys151Ser mutant, similarly to sulforaphane (Fig. 2b, 
compare lanes 16 and 17 to lanes 18 and 19). To analyse KEAP1 alky- 
lation directly, we overexpressed Myc-DDK-tagged KEAP1 in human 
HEK293T cells and treated the cells with OI. Tandem mass spectro- 
metry of immunoprecipitated KEAP1 revealed that for the KEAP1 pep- 
tide (144-152), which contains Cys 151, OI treatment increased its mass 
by 242.15 Da, consistent with alkylation by OI (Fig. 2c). Ol also modi- 
fied other known KEAP1 regulatory cysteine residues (Cys257, Cys288 
and Cys273) (Extended Data Fig. 5b-d, Extended Data Table 1a). 
Furthermore, itaconate-cysteine adducts, derived in part from glucose 
and glutamine (Fig. 2d and Extended Data Fig. 6), were detected in 
LPS-treated macrophages. These data suggest that itaconate activates 
Nrf2 by alkylating KEAP1 cysteine residues. We further explored 
cysteine alkylation induced by itaconate using an untargeted mass 
spectrometry approach in macrophages treated with OI, or with LPS, 
which increases itaconate levels. We identified several proteins that 
contain alkylated cysteine residues (Extended Data Table 1b, c). Notably 
LDHA, which has a crucial role in the regulation of glycolysis, was alky- 
lated in OI- and LPS-treated macrophages (Fig. 2e and Extended Data 
Fig. 5e, f). This modification, here defined as 2,3-dicarboxypropylation, 
generates a stable thioether. As there are no known pathways for the 
removal of such post-translational modifications, modified proteins 
are probably degraded, suggesting that this modification will have pro- 
found effects on macrophage function. 

We next assessed whether itaconate activation of Nrf2 could be 
anti-inflammatory. OI, used at concentrations that did not affect 
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g, Serum IL-1 and TNF levels from mice injected intraperitoneally 

with OI (50mg kg~!, 2h) and/or LPS (2.5 mg kg™!, 2h, n =3 vehicle, OI; 
n=15 LPS, OI plus LPS). h, Nrf2, HMOX1 and IL-16 protein in wild-type 
and Nrf2 knockout (KO) mouse BMDMs treated with LPS (6h) and OI 

as indicated. Data are mean +s.e.m. P values calculated using one-way 
ANOVA. Blots are representative of three independent experiments. For 
gel source data, see Supplementary Fig. 1. 


cellular viability, decreased LPS-induced I/1b mRNA, pro-IL-1{, 
HIF-1aand IL-10 protein levels, and decreased the extracellular acidi- 
fication rate, yet had no effect on NF-KB activity or TNF (also known 
as TNFa) levels (Fig. 3a, b and Extended Data Fig. 7a-f). OI also 
decreased I/1b mRNA in BMDMs treated with the TLR2 and TLR3 
ligands, Pam3CSK and polyinosinic:polycytidylic acid (poly(I:C)), 
respectively (Extended Data Fig. 7g). Levels of LPS-induced reac- 
tive oxygen species (ROS), nitrite and inducible nitric oxide syn- 
thase (INOS) were limited by OI (Fig. 3c, d and Extended Data Fig. 
7h, i). These effects are likely to be a consequence of ROS detoxi- 
fication after Nrf2 induction by OI. IL-1 and TNF were decreased 
by OI in human peripheral blood mononuclear cell (PBMCs) (Fig. 3e, 
Extended Data Fig. 7j). OI also counteracted the pro-inflammatory 
response to LPS in vivo. OI, which activated Nrf2 (Extended Data 
Fig. 7k), prolonged survival, decreased clinical score and improved 
body temperature regulation, and decreased IL-18 and TNF levels 
but not IL-10 in an LPS model of sepsis (Fig. 3f, g and Extended Data 
Fig. 71). 

OI induction of HMOX1] was blocked in Nrf2-deficient macrophages 
(Fig. 3h (compare lanes 2 and 3 to lanes 8 and 9) and Extended Data 
Fig. 8a, d) or when Nrf2 was silenced (Extended Data Fig. 8a, d (com- 
pare lanes 7 and 8 to lanes 11 and 12)). Without Nrf2, the decrease in 
LPS-induced IL-1 with OI was significantly impaired (Fig. 3h (com- 
pare lane 6 to lane 12), Extended Data Fig. 8b-f (compare lanes 6 and 8 
to 10 and 12 inc, d)). Furthermore, two Nrf2 activators, diethyl maleate 
and 15-deoxy-A12,14-prostaglandin J2 decreased LPS-induced IL-1, 
IL-10, nitric oxide synthase (NOS2) and nitrite (Extended Data 
Fig. 8g-k). Thus, itaconate activates an anti-inflammatory program 
through Nrf2. 

We next investigated how switching from a pro- to an anti-inflammatory 
state might affect itaconate production from aconitate by IRGI. 
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Figure 4 | A feedback loop exists between itaconate and IFN-(. 

a, Metabolite levels in control versus IFN-8-treated (1,000 U ml7!; 27h; 
n=5) mouse macrophages. b, LPS-induced (24h) Irg] expression + IFN-8 
(1,000 U ml~}; n=3). ¢, Irg1 expression in wild-type and IEN receptor- 
deficient (Ifnar1 ~/~) BMDMs plus LPS or poly(I:C) (40g ml~') for 24h 
(n=3). d, IFN-B (n =3) expression plus LPS (24h) and OI as indicated. 

e, ISG15 and IKK-e expression after treatment with LPS (24h) and OI. 

f, IFN-( protein expression in PBMCs treated with poly(I:C) (20g ml}; 
24h) and OI (n = 3) as indicated. g, Serum IFN-6 levels from mice 
injected intraperitoneally with OI (50 mg kg~!, 2h) with or without LPS 
(2.5mg kg~!; 2h) (n=3 vehicle, OI; n=15 LPS, Ol and LPS). h, The anti- 
inflammatory role of itaconate. Data are mean +s.e.m. P values calculated 
using one-way ANOVA. Blots are representative of three independent 
experiments. Data in f are representative from one of two human donors. 
For gel source data, see Supplementary Fig. 1. 


By modelling gene networks that control Irg] expression, the IFN 
response factor IRF1 was identified as a regulator'’. We show here 
that itaconate levels are increased after IFN-6 treatment (Fig. 4a), in 
agreement with others!’. Levels of citrate and aconitate, the substrate 
for Irg1, were reduced by IFN-B as was the downstream metabolite 
a-ketoglutarate (Extended Data Fig. 9a). These data are consistent 
with an increase in aconitate conversion to itaconate rather than 
a-ketoglutarate. IFN-3 enhanced basal and LPS-induced Irg1 expres- 
sion (Fig. 4b). LPS- and poly(I:C)-induced Irg1 expression in BMDMs 
lacking type I IFN receptor was decreased (Fig. 4c), indicating that 
autocrine IFN facilitates IRG1 induction. OI limited the IFN response, 
decreasing the expression of IFN-3, IKK-e, ISG20 and ISG15 protein, 
IFN-( production in poly(I:C)-treated PBMCs and LPS-induced IFN-8 
production in vivo (Fig. 4d-g and Extended Data Fig. 9b, c). IFN-8 
enhanced both the mRNA and protein expression of IL-10, with or 
without the addition LPS (Extended Data Fig. 9d), suggesting that the 
decrease in IL-10 after OI treatment is due to reduced type I IFN pro- 
duction”®. Nrf2 knockout or knockdown attenuated the reduction of 
ISG20 expression by OI, whereas the Nrf2 activators diethyl maleate 
and 15-deoxy-A12,14-prostaglandin J2 reduced ISG20 expression 
(Extended Data Fig. 9e-g). This agrees with increased expression of 
IRF3-regulated genes in LPS-treated Nrf2-deficient mice!. 

These data suggest the operation of a negative-feedback loop: itaco- 
nate is generated in response to LPS, in part through type I IFNs, 
and promotes an anti-inflammatory program by Nrf2 activation 
(Fig. 4h), as well as SDH inhibition*”. This limits further inflammatory 
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gene expression and its own production by downregulating the IFN 
response. This helps to explain why Nrf2-deficient mice are more sen- 
sitive to septic shock”!, even though under certain circumstances these 
mice are protected from inflammation”’. Our identification of itaconate 
as an inflammatory regulator, that directly modifies proteins through 
a newly identified post-translational modification, unveils therapeutic 
opportunities to use itaconate or OI to treat inflammatory diseases”*. 
Furthermore, an intriguing link was recently made” from itaconate to 
vitamin Bj, and this warrants further investigation in the context of 
inflammation and immunity. Further understanding the role of itaco- 
nate as an anti-inflammatory metabolite and regulator of type I IFNs 
is likely to yield new insights into the pathogenesis of inflammatory 
diseases. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Isolation of human PBMCs. Human PBMCs were isolated from human blood 
using Lymphoprep (Axis-Shield). Whole blood (30 ml) was layered on 20 ml 
lymphoprep and spun for 20 min at 2,000 r.p.m. with no brake on. The PBMCs 
were isolated from the middle layer. PBMCs were maintained in RPMI supple- 
mented with 10% (v/v) FCS, 2mM t-glutamine, and 1% penicillin/streptomycin 
solution. 

Generation of human macrophages. Blood was layered on Histopaque and 
centrifuged at 800g for 20 min, acceleration 9, and deceleration at 4. The PBMC 
layer was isolated and the macrophages were sorted using magnetic-activated cell 
sorting (MACS) CD14 beads. Cells were plated at 0.5 x 10° cells ml~! in media 
containing M-CSF (100 ng ml!) and maintained at 37 °C, 5% CO; for 5 days, to 
allow differentiation into macrophages. For further details, see Supplementary 
Methods. 

Generation and treatment of BMDMs. Mice were euthanized in a CO cham- 
ber and death was confirmed by cervical dislocation. Bone marrow cells were 
extracted from the leg bones and differentiated in DMEM (containing 10% fetal 
calf serum, 1% penicillin/streptomycin and 20% L929 supernatant) for 6 days, at 
which time they were counted and replated for experiments. Unless stated, 5 x 10° 
BMDMs per millilitre were used in in vitro experiments. Unless stated, the LPS 
concentration used was 100 ng ml-!, the DMI and OI concentration was 125 uM, 
and in experiments where pre-treatments occurred before LPS stimulation this 
was for 3h. 

Synthesis of itaconate compounds. For details on synthesis and characterization 
of chemical compounds, see Supplementary Methods. 

Metabolomic analysis with Metabolon. Macrophages were plated at 2 x 10° 
per well in 6-well plates and treated as required. BMDMs n=5, human mac- 
rophages n= 12. Analysis was performed by Metabolon. For further details, 
see Supplementary Methods. 

Metabolite measurements for absolute succinate and itaconate quantifica- 
tion and metabolite tracing. Cells were treated as desired. For tracing studies, 
immediately before LPS stimulation, the media was removed and replaced with 
DMEM media (1 ml) containing U-'3C-glucose (4.5 g 1~!) or U-/8C-glutamine 
(584mg ml ') deplete of '?C- glucose or '*C-glutamine. Samples were extracted in 
methanol/acetonitrile/water, 50:30:20 (v/v/v) (1 ml per 1 x 10° cells) and agitated 
for 15 min at 4°C in a Thermomixer and then incubated at —20°C for 1h. Samples 
were centrifuged at maximum speed for 10 min at 4°C. The supernatant was trans- 
ferred into a new tube and centrifuged again at maximum speed for 10 min at 4°C. 
The supernatant was transferred autosampler vials. Liquid chromatograph-mass 
spectrometry (LC-MS) analysis was performed using a Q Exactive mass spectrom- 
eter coupled to a Dionex U3000 UHPLC system (Thermo). For further details, see 
Supplementary Methods. 

Western blotting. Protein samples from cultured cells were prepared by direct lysis 
of cells in 5x Laemmli sample buffer, followed by heating at 95°C for 5 min. For 
spleen samples, 30 mg of spleen was homogenized in RIPA buffer using the Qiagen 
TissueLyserII system. The resulting homogenate was centrifuged at 14,000 r.p.m. 
for 10 min at 4°C, and supernatants were used for SDS-PAGE. Protein samples 
were resolved on 8% or 12% SDS-PAGE gels and were then transferred onto 
polyvinylidene difluoride (PVDF) membrane using either a wet or semi-dry transfer 
system. Membranes were blocked in 5% (w/v) dried milk in TBS-Tween (TBST) 
for at least 1h at room temperature. Membranes were incubated with primary 
antibody, followed by the appropriate horseradish peroxidase-conjugated second- 
ary antibody. They were developed using LumiGLO enhanced chemiluminescent 
(ECL) substrate (Cell Signalling). Bands were visualized using the GelDoc system 
(Biorad). 

Quantitative PCR. Total RNA was isolated using the RNeasy Plus Mini kit 
(Qiagen) and quantified using a Nanodrop 2000 UV-visible spectrophotometer. 
cDNA was prepared using 20-100 ng,il ! total RNA by a reverse transcription 
PCR (RT-PCR) using a high capacity cDNA reverse transcription kit (Applied 
Biosystems), according to the manufacturer’s instructions. Quantitative PCR 
(qPCR) was performed on cDNA using SYBR Green probes. qPCR was performed 
ona 7900 HT Fast Real-Time PCR System (Applied Biosystems) using Kapa fast 
master mix high ROX (Kapa Biosystems, for SYBR probes) or 2 PCR fast master 
mix (Applied Biosystems, for Taqman probes). For SYBR primer pair sequences, 
see Supplementary Methods. Fold changes in expression were calculated by the 
AAC, method using mouse Rps18 as an endogenous control for mRNA expression. 
All fold changes are expressed normalized to the untreated control. 

NQO1 bioassay. Inducer potency was quantified by use of the NQO1 bioassay in 
Hepalclc7 mouse hepatoma cells'»!”, Cells (10 per well of a 96-well plate) were 
grown for 24h and exposed (n= 8) to serial dilutions of compounds for 48h before 
lysis. NQO1 enzyme activity was quantified in cell lysates using menadione as a 
substrate. Protein concentrations were determined in aliquots from the same cell 


lysates by the bicinchoninic acid (BCA) assay (Thermo Scientific). The CD value 
was used as a measure of inducer potency. For assays examining the effect of GSH 
on inducer potency, 501M of each compound was incubated with 1 mM GSH in 
the cell culture medium at 37 °C for 30 min before treatment. 
Preparation of rat liver mitochondria. Female Wistar rats aged between 10 and 
12 weeks (Charles River) were culled by stunning and cervical dislocation before 
the liver being excised and stored in ice-cold buffer (STE buffer; 250mM sucrose, 
5mM Tris-Cl, 1 mM EGTA (pH 7.4 at 4°C)). Rat liver mitochondria were iso- 
lated by homogenization and differential centrifugation at 4°C in STE buffer”®. In 
brief, minced tissue was homogenized in STE buffer before centrifugation (1,000g, 
3 min, 4°C) and centrifuging the resulting supernatant (10,000g, 10 min, 4°C). The 
mitochondrial pellet was resuspended in fresh STE before centrifuging (10,000g, 
10 min, 4°C). The resulting pellet was resuspended in STE and assayed for protein 
concentration via BCA assay (Thermo Scientific) against a BSA standard curve. 
Preparation of bovine heart mitochondrial membranes. Bovine heart mitochon- 
dria were isolated by differential centrifugation in 250mM sucrose, 10 mM Tris-Cl, 
0.2mM EDTA (pH 7.8 at 4°C). To prepare membranes, bovine heart mitochondria 
were blended with MilliQ water at 4°C before adding KC] to a final concentra- 
tion of 150 mM and blending until homogenous. The suspension was centrifuged 
(13,500g, 40 min, 4°C) and the pellet was resuspended in re-suspension buffer 
(20 mM Tris-Cl, 1mM EDTA, 10% glycerol, pH 7.55 at 4°C) before homogeniza- 
tion and assaying for protein by BCA assay (Thermo Scientific)”. 
Measuring complex II and III activity. Bovine heart mitochondrial membranes 
(801g protein per ml) were incubated in 50mM potassium phosphate buffer 
(50mM potassium phosphate, 1 mM EDTA, pH 7.4, 4°C) supplemented with 
3mM KCN, 4M rotenone and succinate. In a 96-well microplate, inhibitor or 
vehicle control and membrane incubation were plated and incubated for 10 min 
at 30°C. Alternatively, where indicated, itaconate was incubated with membranes 
and removed by twice centrifuging membranes and resuspending in non-itaconate 
containing buffer, before plating with 1 mM succinate. Oxidized cytochrome-c was 
added before measuring the respiratory chain activity by assessing the reduction 
of cytochrome-c spectrophotometrically at 550 nm at 20s intervals for 5 min at 
30°C. Final concentrations were 101g protein per well bovine heart membranes 
and 30M ferricytochrome c. 
Measuring rat liver mitochondrial respiration. Respiration of rat liver mitochon- 
dria was assessed with an Oxygraph-2K (OROBOROS instruments high resolution 
respirometry). Rat liver mitochondria (0.5 mg mitochondrial protein per ml) were 
added to KCl buffer (pH 7.2, 37°C) and respiration assessed in the presence of 
4g ml! rotenone, 1 mM succinate, 11M FCCP and inhibitors or buffer control. 
Assessing itaconate ester reactivity with glutathione. GSH (1 or 5mM) and 
5 mM itaconate esters or vehicle control were incubated in KCl buffer (pH 7.2 
or 8) at 37°C for 2h, where indicated, 101g recombinant GST was added to the 
incubation. The reaction was stopped by acidification with 5% sulfosalicylic acid 
before assessing glutathione content by the GSH recycling assay as described 
previously”®. 
Itaconate transport assays. Itaconate transport by mitochondrial carriers was 
assessed as described previously”*. For further details see Supplementary Methods. 
Cell uptake of itaconate. C2C12 mouse myoblasts were plated at 300,000 cells 
per well in a 6-well plate in complete growth medium and adhered overnight in a 
humidified 5% CO3, 37°C incubator. The following day, media was replaced with 
serum-free DMEM containing itaconate esters and cells were treated for 30 min 
at 37°C. Cells were extracted as described above (method for succinate quantifi- 
cation), with MS internal standard (100 pmol) added and stored at —80°C before 
LC-MS/MS analysis. For further details, see Supplementary Methods. 
LC-MS/MS analysis was performed using an LCMS-8060 mass spectrometer 
(Shimadzu) with a Nexera X2 UHPLC system (Shimadzu). For further details, see 
Supplementary Methods. 
KEAPI cysteine target validation. COS1 cells (2.5 x 10° per well) in 6-well plates 
were co-transfected (Lipofectamine 2000) with 0.8 1g of Nrf2-V5 and 1.6g of 
wild-type or Cys151S mutant KEAP1", or 1.6 jg of pcDNA. Cells were grown for 
21h then treated with 20 or 100j1M OI, 54M sulforaphane or 0.1% acetonitrile 
(vehicle) for 3h. Cell were washed in PBS and lysed in 200 ul of SDS-lysis buffer 
(50mM Tris-HCl, pH 6.8, 2% (w/v) sodium dodecyl sulfate (SDS) and 10% (v/v) 
glycerol). Lysates were sonicated (20s at 30% amplitude using Vibra-Cell ultrasonic 
processor, Sonic) and boiled (3 min), and dithiothreitol (DTT) and Bromophenol 
blue were added up to 0.1 M and 0.02% (w/v) final concentrations, respectively. 
Proteins (101g) were resolved on a gradient (4—12%) NuPAGE SDS gel, transferred 
onto nitrocellulose membranes, and immunoblotted with anti-KEAP1 (rat mono- 
clonal, Merk Millipore, clone 144), anti-Nrf2 (rabbit monoclonal, CST), and anti- 
6-actin (mouse monoclonal, Sigma) antibodies. Horseradish peroxidase (HRP)- or 
IRDye-labelled secondary antibodies were used interchangeably, followed by either 
ECL detection or scanning using Odyssey imager (Li-COR). 
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ELISA. Cytokine concentrations in cell supernatants were measured using ELISA 
Duoset kits for mouse IL-10 and TNF and human IFN-8 and IL-18, according to 
the manufacturer’s instructions. Cytokine concentrations in serum samples iso- 
lated from whole blood were measured using Quantikine ELISA kits for mouse or 
human IL-18, IFN-8, IL-10 and TNE. Duoset and Quantikine kits were from R&D 
Systems. Optical density values were measured at a wavelength of 450 nm, using a 
FLUOstar Optima plate reader (BMG Labtech). Concentrations were calculated 
using a four-parameter fit curve. 

FACS analysis of ROS. BMDMs were seeded at 0.5 x 10° cells per ml and treated 
as normal. Then 2h before staining, 100% ethanol was added to the dead cell con- 
trol well. Thirty minutes before the end of the stimulation, CellROX (51M) was 
added directly into the cell culture medium. Supernatants of cells that were to be 
stained with Aqua Live/Dead were removed, and an Aqua Live/Dead dilution (1 ml; 
1:1,000 in PBS) was added to each well. Cells were incubated in tinfoil at 37°C 
for 30 min. Cells were washed with PSB, scraped in PBS (0.5 ml), and transferred 
to polypropylene FACS tubes. Samples were analysed using a DAKO CyAn flow 
cytometer, and data was analysed using FlowJo software. MFI was quantified as a 
measure of cellular ROS production. 

Nitric oxide assay. Nitric oxide concentrations in cell supernatants were measured 
using Greiss reagent assay kit from Thermo Fischer Scientific according to the 
manufacturer's instructions. Optical density values were measured at a wavelength 
of 548 nm, using a SoftMax Pro plate reader. Concentrations were calculated using 
a linear standard curve. 

GSH/GSSG measurements. BMDMs were plated at 0.1 x 10° cells per ml in 
opaque 96-well plates. Cells were pre-treated with OI (125 1M) for 2h and then 
stimulated with hydrogen peroxide (100|1M) for 24h. After 24h, cell media was 
removed and the reduced glutathione to oxidized glutathione (GSH/GSSG) ratio 
was quantified using MyBio GSH/GSSG-Glo Assay (V6611) as per manufacturer's 
instructions. Luminescence was quantified using a FLUOstar Optima plate reader. 
LDH assay. Cells were plated at 0.5 x 10° cells per ml in white 24-well plates (500 j1l 
per well) and treated as required. Cytotoxicity, as determined by LDH release, 
was assayed using CytoTox96 Non-radioactive Cytotoxicity Assay kit (Promega) 
according to the manufacturer’s instructions. 

Seahorse analysis of lactate production. Cells were plated at 0.2 x 10° cells per 
well of a 24-well Seahorse plate. Cells were treated and stimulated as normal. 
A utility plate containing calibrant solution (1 ml per well) was placed in a CO - 
free incubator at 37°C overnight. The next day, media was removed from cells 
and replaced with glucose-supplemented XF assay buffer (500 1l per well) was 
placed in a CO>-free incubator for at least 0.5h. Compounds (glucose, oligomycin 
and 2-deoxy-p-glucose (2DG); 7011) were added to the appropriate port of the 
injector plate. This plate together with the utility plate was run on the Seahorse 
for calibration. Once complete, the utility plate was replaced with the cell culture 
plate and run on the Seahorse XF-24. 

Endotoxin-induced model of sepsis. For cytokine measurements, mice were 
treated intraperitoneally with OI (50 mg kg” !) in 40% cyclodextrin in PBS or 
vehicle control for 2h before stimulation with LPS (Sigma; 2.5 mg kg”) intra- 
peritoneally for 2h. Mice were euthanized in a CO chamber, blood samples were 
collected and serum was isolated. Cytokines were measured using R&D ELISA kits 
according to manufacturer’s protocol. For temperature recording, mice (n= 10 per 
group) were treated intraperitoneally with OI (50mg kg ') in 40% cyclodextrin 
in PBS or vehicle control for 2h before stimulation with LPS (5mg kg!) and 
monitored for temperature at 1, 2, 3, 4, 6, 12, 18 and 24h after LPS treatment. 
Temperature was monitored using subcutaneously implanted temperature tran- 
sponder chips (Bio Medic Data Systems; IPTT 300) which were injected between 
the shoulder blades 48 h before experiment. At defined times, body temperature 
was measured by scanning the transponder with a corresponding BMDS Smart 
Probe. Animals were additionally monitored for clinical signs of endotoxic shock, 
based on temperature change, body condition, physical condition and unprovoked 
behaviour, with a combined score of 9 indicating the humane end point for the 
experiment. 

siRNA transfection of BMDMs. Cells were plated at 1 x 10° cells per ml in 
12-well plates overnight. On the day of transfection, the media was replaced with 
500 ul DMEM without penicillin/streptomycin or FBS. For each target gene, two 
Eppendorfs were prepared. Optimem (250 1 per well) was added to each tube. 
RNAimax (add 511 per well) was added to one set of tubes and short interfering 
siRNA (siRNA; 50nM per well) was added to the second set of tubes. The tube con- 
taining the siRNA was added to the tube with RNAimax, mixed well by pipetting 
and incubated for 15 min. The mix (50011) was added to each well. Twenty-four 
hours after transfection, cells were treated as required. 

Analysis of KEAP1 modification by OI. Human embryonic kidney cells 
(HEK293T cells) were transfected with a pCMV6-KEAP1 vector (Myc-DDK- 
tagged mouse KEAP1) (OriGene). C2C12, Hepalcic7 cells and COS1 cells 
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were from American Type Culture Collection (ATCC). The L929 cells are from 
Sigma (85011425). HEK293T cells were obtained from the Centre for Applied 
Microbiology and Research. Cell lines have not been tested for mycoplasma 
contamination. Twenty-four hours after transfection, cells were treated with OI 
(500 1M) or vehicle control (PBS) for 4h. Tagged KEAP1 was immunoprecipitated 
using an anti-Flag antibody (Sigma) and protein A/G beads (Santa Cruz). After 
immunoprecipitation, bound KEAP1 was eluted off the beads using Flag pep- 
tide (50011; 200 pg ml!) (Sigma) diluted in 1x TBS pH 7.4. The samples were 
then concentrated and the Flag peptide was removed using 10K centrifugation 
filter columns (Merck). The concentrated samples were then divided in half for 
downstream processing. One-half of each sample was diluted 1:2 with 5x SDS 
sample buffer and separated using SDS-PAGE (Bio-Rad). Overexpressed KEAP1 
was detected using Coomassie blue staining and the corresponding bands were 
excised from the gel and subjected to in-gel digest as described. In brief, the gel 
slices were cut into smaller pieces (1-2 mm’) before reduction with DTT (10 mM) 
and alkylation with iodoacetamide (50 mM). Half of the gel slices from each sample 
were then subjected to a trypsin (21g) digest, the other half were digested with 
elastase (1 1g) overnight at 37 °C. Similarly, the remaining sample concentrates 
(in solution) were reduced with DTT and alkylated with iodoacetamide, before 
precipitation of the protein via the methanol-chloroform extraction method. The 
protein pellet was re-suspended in urea (6 M), which was then diluted to <1M urea 
with ultrapure H,O. The samples were then digested with trypsin (21g) overnight 
at 37°C. Digested protein samples were analysed in an Orbitrap Fusion Lumos 
coupled to a UPLC ultimate 3000 RSLCnano System (both Thermo Fisher). For 
further details, see Supplementary Methods. 

Assessment of cysteine alkylation by itaconate using lodo-TMT. After treatment, 
cells were lysed in HEPES pH 7.5, EDTA, glycerol and NP40. 2mM TCEP and 
50mM NEM were added in a buffer containing 50mM HEPES, 2% SDS, 125mM 
NaCl, pH 7.2, and samples were incubated for 60 min at 37°C in the dark to reduce 
and alkylate all unmodified protein cysteine residues. 20% (v/v) TCA was added 
to stabilize thiols and incubated overnight at 4°C and then pelleted for 10 min at 
4,000g at 4°C. The pellet was washed three times with cold methanol (2 ml) and 
then resuspended in 2 ml 8 M urea containing 50 mM HEPES, pH 8.5. Protein 
concentrations were measured by BCA assay (Thermo Scientific) before protease 
digestion. Protein lysates were diluted to 4M urea and digested with LysC (Wako) 
in a 1:100 enzyme:protein ratio and trypsin (Promega) at a final 1:200 enzyme:pro- 
tein ratio for 4h at 37°C. Protein extracts were diluted further to a 2.0 M urea 
and LysC (Wako) at 1:100 enzyme:protein ratio and trypsin (Promega) at a final 
1:200 enzyme:protein ratio were added again and incubated overnight at 37°C. 
Protein extracts were diluted further to a 1.0 M urea concentration, and trypsin 
(Promega) was added to a final 1:200 enzyme:protein ratio for 6h at 37°C. Digests 
were acidified with 250 1l of 25% acetic acid to a pH value of ~2, and subjected to 
C18 solid-phase extraction (50 mg Sep-Pak, Waters). Excess TMT label (6-7 M) 
was added to each digest for 30 min at room temperature (repeated twice). The 
reaction was quenched using 411 5% hydroxylamine. Samples were subjected to an 
additional C18 solid-phase extraction (50 mg Sep-Pak). For LC-MS/MS param- 
eters, data processing and MS2 spectra assignment, TMT reporter ion intensities 
and quantitative data analysis, see Supplementary Methods. 

Reagents. For a complete list of reagents, see Supplementary Methods. 

Mouse strains. Wild-type C57BL/6 mice were from Harlan UK and Harlan 
Netherlands. Animals were maintained under specific pathogen-free conditions 
in line with Irish and European Union regulations. Experiments were approved 
by local ethical review and were carried out under the authority of Ireland’s pro- 
ject license. All animal studies performed in GSK were ethically reviewed and 
carried out in accordance with Animals (Scientific Procedures) Act 1986 and the 
GSK Policy on the Care, Welfare and Treatment of Animals. Nrf2-deficient mice 
and their wild-type counterparts, both on the C57BL/6 genetic background (used 
for isolation of BMDM cells), were bred and maintained in the Medical School 
Resource Unit of the University of Dundee. 

Statistical analysis. Data were expressed as mean + s.e.m. and P values were 
calculated using two-tailed Student's t-test for pairwise comparison of variables, 
one-way ANOVA for multiple comparison of variables, and two-way ANOVA 
involving two independent variables. A Sidak’s multiple comparisons test was used. 
A confidence interval of 95% was used for all statistical tests. Sample sizes were 
determined on the basis of previous experiments using similar methodologies. For 
all experiments, all stated replicates are biological replicates. For in vivo studies, 
mice were randomly assigned to treatment groups. For mass spectrometry analy- 
ses, samples were processed in random order and experimenters were blinded to 
experimental conditions. 

Data availability. Full scans for all western blots are provided in Supplementary 
Fig 1. Source Data for all mouse experiments have been provided. All other data 
are available from the corresponding author on reasonable request. 
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Extended Data Figure 1 | The effect of itaconate on complex II c, d, Complex II and III activity in bovine heart mitochondrial membranes 


activity. a, Complex II and III activity in bovine heart mitochondrial incubated with itaconate (IA; 1 mM unless indicated), with subsequent 
membranes incubated with succinate plus malonate or itaconate (n =3 removal and addition of succinate (1 mM; n = 3 independent experiments) 
independent experiments). b, Effect of malonate or itaconate on the (see Methods for further details). Data are mean + s.e.m. P values 


oxygen consumption rate (OCR) of rat liver mitochondria in the presence calculated using one or two-way ANOVA. 
of succinate (1 mM) and FCCP (1,.M; n = 3 independent experiments). 
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Extended Data Figure 2 | DMI activates Nrf2 and limits cytokine (n=5). e, f, LPS (24h)-induced J/1b mRNA (e), IL-13 and HIF-1a protein 
production. a, c, LPS (100 ng ml~!)-induced Nrf2 (a, 24h) and HMOX1 (f) expression in mouse macrophages with or without DMI (n= 3). Data 
(c, 6h) protein expression with or without the itaconate derivative DMI. are mean + s.e.m. P values calculated using one-way ANOVA. Blots are 
b, Nrf2-dependent mRNA expression after treatment with LPS representative of three independent experiments. For gel source data, see 
(6h) and DMI where indicated (n = 3). d, Reduced glutathione (GSH) Supplementary Fig. 1. 


and oxidized glutathione (GSSG) levels after treatment with LPS and DMI 
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Extended Data Figure 3 | O] is the best tool to assess itaconate- 
dependent Nrf2 activity. a, Reactivity of DMI, itaconate and OI 

with thiols. b, c, Itaconate ester reactivity with GSH and glutathione- 
S-transferase (GST) as detailed in the Methods (n = 3). d, Itaconate levels 
in mouse C2C12 cells plus itaconate esters (nm = 3). MI, 4-methy] itaconate. 
e, i, Itaconate (e) or GSH (i) levels plus LPS (6h) and Ol] as indicated 
(n=5). f, NQO1 activity in mouse Hepalclc7 cells treated with DMI or 
OI (48h) and GSH (n= 8). g, h, Metabolic intermediates in GSH synthesis 
(h, average of five biological replicates). i, GSH levels after treatment 

with LPS (6h) and/or OI (n=5). j, GSH/GSSG ratio after treatment 


with OI (2h) and HO; (100M, 24h; n = 3) as indicated. k, HMOX1 
protein levels after treatment with OI and/or H2O> (24h). 1, Nrf2, 
HMOxX1 and IL-1 protein levels in BMDMs pre-treated with OI, 4-octyl 
2-methylsuccinate (OMS) or octyl succinate (OS), all 125 41M for 3 h with 
or without LPS (6h). m, LPS-induced Nrf2 (24h) and HMOX1 (6h) 
protein expression with or without dimethyl malonate (DMM). Data are 
mean + s.e.m. P values calculated using one- or two-way ANOVA. Blots 
are representative of three independent experiments. For gel source data, 
see Supplementary Fig. 1. 
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Extended Data Figure 4 | Itaconate is transported by the mitochondrial [?H]itaconate (141M). b, Initial transport rates of each carrier with either 


oxoglutarate, dicarboxylate and citrate carriers. a, Itaconate uptake into canonical substrate (homo-exchange) or canonical substrate/itaconate 
vesicles of Lactococcus lactis membranes expressing the indicated carriers (hetero-exchange). n = 4 independent experiments; data are mean + s.d. 
loaded with itaconate (1 mM), and transport initiated by the addition of P values calculated using two-tailed Student's t-test. 
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Extended Data Figure 5 | KEAP1 is alkylated by OI on major redox alkylation after treatment with LPS (e, 24h) or OI (f, 250M, 4h) (n=4). 
sensing cysteine residues. a, Modification of cysteine by fumarate or Detected N- and C-terminal fragment ions of both peptides are assigned 
itaconate. Tandem mass spectrometry spectrum of KEAP1 Cys257 in the spectrum and depicted as follows: b: N-terminal fragment ion; 
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carbamidomethylated (CAM) peptides (right). e, f, LDHA Cys84 Representative of one independent experiment. 
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Extended Data Figure 6 | Identification of an itaconate-cysteine adduct. 
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tracking itaconate-cysteine adduct formation in BMDMs treated with 
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Extended Data Figure 7 | OI decreases LPS-induced cytokine 
production, extracellular acidification rate, ROS and nitric oxide. 

a, Percentage cytotoxicity (LDH release) in BMDMs after treatment 

with LPS and OI as indicated (n = 3). b, LPS-induced extracellular 
acidification rate (ECAR) after treatment with OI and/or LPS as indicated, 
analysed on the Seahorse XF-24 in BMDMs (trace representative of three 
independent experiments). c, d, LPS-induced I/10 mRNA (c, 4h) and 
protein (d, 24h) and TNF protein (f; n = 7) after OI treatment as indicated 
(n= 3). e, Phosphorylated p65 (pp65) protein levels (a measure of NF-kB 
activity) after treatment with LPS and OJ as indicated. h, Representative 
gating strategy for FACS analysis of ROS production in cells as treated 


in d (image representative of three independent experiments). i, LPS- 
induced NOS2 expression (n= 6), with or without OI treatment. j, LPS- 
induced TNF (n= 4) and IL-18 (n=3) protein levels after OI treatment 
in PBMCs. k, Nrf2 and HMOX!1 protein levels or Nrf2-dependent gene 
expression (1 =5) in peritoneal macrophages from mice (m) injected 
intraperitoneally with OI (50 mg kg~!, 6h) or vehicle control. 1, Serum 
IL-10 from mice injected intraperitoneally with vehicle control or OI 
(50 mg kg~!, 2h) and LPS (2.5 mg kg™!, 2h, n=3 vehicle, Ol; n= 15 LPS, 
OI plus LPS). Data are mean + s.e.m. P values calculated using one-way 
ANOVA. Blots are representative of three independent experiments. For 
gel source data, see Supplementary Fig. 1. 
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Extended Data Figure 8 | The effects of OI on cytokine production are g-k, I/1b (g) and Nos2 (j) mRNA, and IL-1 (h), IL-10 (i), TNF and 
Nrf2-dependent. a-e, Nrf2, HMOX1 and IL-1{ protein levels (a, ¢, d) nitrite (k) production with or without LPS (24h), diethyl maleate (DEM; 
and [/1b mRNA expression (b, e) in mouse BMDMs transfected with two 100 1M) or 15-deoxy-A12,14-prostaglandin J2 (J2; 51M) pre-treatment 
different Nrf2 siRNAs (50 nM) compared with a non-silencing scrambled for 3h (n=3). Data are mean +s.e.m. P values calculated using one-way 
control siRNA plus LPS (6h; a-c, e; 24h; d) and/or OI (n =6). NT, non- ANOVA. Blots are representative of three independent experiments. For 
transfected. f, I/1b mRNA expression in wild-type and Nrf2-knockout gel source data, see Supplementary Fig. 1. 


BMDMs treated with LPS (24h; WT n= 2, Nrf2 KO n=4) and/or OI. 
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Extended Data Table 1 | Mass spectrometry analysis of itaconate-induced cysteine alkylation 


a 
4-Ol Peptide Peptide Sequence -10logP| Enzyme | Digest 
Residue | Amino Acid Type 

Position 
Cys151 144-152 A.SISVGEKC(+242.15)V.L Elastase | In Gel 
146-152 I. SVGEKC(+242.15)V.L 
144-153 A.SISVGEKC(+242.15)VL.H 
145-152 S.ISVGEKC(+242.15)V.L 
Cys297 _ | 294-304 A.DARC(+242.15)KDYLVQI.F 37.15 
b 
. Alkylated |Peptide amino| Peptide sequence 
Cys385 378-395 VGLIGSC(+4.98)TNSSYEDMGR 4.212 
Cys189 186-204 GDRC(4.98)QDLSVNQDLADTDAR 3.514 
Ipyr2 Cys156, 157] 153-171 STDC(4.98)C(+4.98)GDNDPIDVCEIGSK 4.664 22.60 
Cys73 73-81 C(+4.98)MPTFQFYK 2.422 
c 


‘ Alkylated |Peptide amino} Peptide sequence 
: X scor 
Preis | residue acid poston aera i 


a, Cysteine/lysine residue(s) in KEAP1 modified by Ol as determined by tandem mass spectrometry. b, Cysteine residues modified by itaconate 
in BMDMs treated with LPS identified using tandem mass spectrometry. ¢, Cysteine residues modified by itaconate in BMDMs treated with OI 
identified using tandem mass spectrometry. 
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Structure of the peptidoglycan polymerase RodA 
resolved by evolutionary coupling analysis 


Megan Sjodt!, Kelly Brock”, Genevieve Dobihal’, Patricia D. A. Rohs’, Anna G. Green”, Thomas A. Hopf?, Alexander J. Meeske?, 
Veerasak Srisuknimit*, Daniel Kahne*, Suzanne Walker’, Debora S. Marks2, Thomas G. Bernhardt?, David Z. Rudner? & 


Andrew C. Kruse! 


The shape, elongation, division and sporulation (SEDS) proteins are 
a large family of ubiquitous and essential transmembrane enzymes 
with critical roles in bacterial cell wall biology. The exact function 
of SEDS proteins was for a long time poorly understood, but recent 
work! has revealed that the prototypical SEDS family member 
RodA is a peptidoglycan polymerase—a role previously attributed 
exclusively to members of the penicillin-binding protein family’. 
This discovery has made RodA and other SEDS proteins promising 
targets for the development of next-generation antibiotics. However, 
little is known regarding the molecular basis of SEDS activity, and 
no structural data are available for RodA or any homologue thereof. 
Here we report the crystal structure of Thermus thermophilus 
RodA at a resolution of 2.9 A, determined using evolutionary 
covariance-based fold prediction to enable molecular replacement. 
The structure reveals a ten-pass transmembrane fold with large 
extracellular loops, one of which is partially disordered. The protein 
contains a highly conserved cavity in the transmembrane domain, 
reminiscent of ligand-binding sites in transmembrane receptors. 
Mutagenesis experiments in Bacillus subtilis and Escherichia coli 
show that perturbation of this cavity abolishes RodA function 
both in vitro and in vivo, indicating that this cavity is catalytically 
essential. These results provide a framework for understanding 
bacterial cell wall synthesis and SEDS protein function. 

The synthesis of a cell wall and maintenance of its integrity are essen- 
tial processes for virtually all Eubacteria, and the targeted disruption 
of cell wall biogenesis is among the most effective of therapeutic strat- 
egies in the treatment of bacterial infections. A central step in cell wall 
synthesis is the concatenation of a lipid II disaccharide pentapeptide 
headgroup onto a peptidoglycan chain through a glycosyl transfer 
reaction. This reaction has long been known to be catalysed by the 
glycosyltransferase domains of class A penicillin-binding proteins* 
(aPBPs). However, deletion of all aPBPs is tolerated in B. subtilis? 
and bacteria that lack aPBPs are able to synthesize peptidoglycan®, 
which implies the existence of non-aPBP glycosyltransferases. This 
was recently resolved with the discovery that the highly conserved 
SEDS membrane proteins comprise a second class of peptidoglycan 
polymerases! (Fig. 1a). In fact, SEDS proteins are even more widely 
distributed than aPBPs!’. Despite their importance and broad phylo- 
genetic distribution, however, SEDS protein function is not well under- 
stood and no SEDS protein has yet been characterized structurally. 

To better understand SEDS protein function, we pursued structural 
studies of wild-type T. thermophilus RodA as well as a catalytically inac- 
tive RodA mutant (D255A). The proteins were expressed, purified and 
then crystallized using the lipidic cubic phase method. X-ray diffraction 
datasets were collected to a resolution of 2.9 A and 3.2 A for wild-type 
and D255A mutant proteins, respectively. Because RodA has no homo- 
logues of known structure, phase calculation by molecular replacement 
was impossible. A wide variety of heavy-atom phasing approaches were 


attempted without success. Having exhausted conventional methods, 
we sought to develop an approach that requires neither experimental 
phase data nor the structure of a homologous protein. 

Recent methodological advances in molecular replacement 
have expanded the range of suitable templates®, and evolutionary 
co-variation analysis now allows for fold prediction even in the absence 
of prior structural data?"''. This approach exploits the fact that residues 
that interact with one another structurally tend to co-evolve to maintain 
their interactions. The analysis of many sequences enables inference 
of spatial interactions between pairs of residues, providing restraints 
sufficient to define major features of protein structure. Analysis of 
RodA by this method showed extensive co-variation throughout the 
protein (Fig. 1b, c), and we reasoned that using evolutionary coupling 
restraints to build RodA models might provide suitable templates for 
molecular replacement phasing, an approach we call ‘evolutionary 
coupling-enabled molecular replacement’ (EC-MR). 

In brief, our EC-MR approach consisted of the parallel construction 
and sampling of many independent evolutionary coupling-derived 
models of RodA to identify suitable templates, followed by phase 
calculation and model building. First, 100 models of RodA were con- 
structed on the basis of evolutionary restraints. These were each tested 
as single templates for molecular replacement in Phaser’. Of these, 22 
models yielded a cluster of solutions that were high-scoring and similar 
to one another (Fig. 1d). An ensemble search model was constructed 
from a subset of these, producing maps that were suitable for manual 
rebuilding followed by ROSETTA refinement in Phenix’. The final 
refined structure showed normal crystallographic statistics (Extended 
Data Fig. 1, Extended Data Table 1). While previous work has estab- 
lished that structural models constructed ab initio can in principle be 
suitable for molecular replacement phasing in select cases'*"», these 
analyses were conducted on very short proteins (<100 amino acids) 
with high-resolution structural data (<2.1 A). Other work has shown 
that at very high resolution, structures can be solved using even a single 
atom as a search model!®. At lower resolution, symmetric «-helical 
proteins can be phased from helical fragments, although this relies on 
symmetry conditions that are met in only a small minority of cases!”. 
Determination of the structure of RodA by EC-MR establishes that 
evolutionary covariance-derived models can be suitable for phase 
determination of even a large (359 amino acid) asymmetric protein 
at modest resolution. 

The structures of wild-type RodA and the D255A mutant are virtu- 
ally identical (Extended Data Fig. 2), and we focus here on the higher- 
resolution wild-type RodA structure. The overall structure shows 
ten well-resolved transmembrane helices (TM1-TM10) connected 
by loops, most of which are well ordered (Fig. 2). Searches for pro- 
teins of similar fold with the DALI server’ yielded no hits, indicating 
that RodA possesses a unique overall fold. The transmembrane heli- 
ces of RodA are largely straight and perpendicular to the membrane 
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Figure 1 | Biological role of RodA and 
evolutionary co-variation fold prediction. 
4] a, RodA is a peptidoglycan polymerase. 
b, Evolutionary co-variation map showing 476 
co-evolved residues in RodA. c, Evolutionary 
couplings (green lines) were used to generate 
models of RodA. Ten representative models 
are shown. d, Molecular replacement (MR) 
solutions obtained from 100 RodA models. 
As a reference point, the C, atom of Glu108 is 
shown as a sphere coloured according to log- 
likelihood gain (LLG) score for each solution. 
The corresponding position for the final refined 
RodA structure is shown as a crosshair. The 
refined structure (green) and the C, atom of 
Glu108 (magenta sphere) is overlaid with the 
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plane, with the exception of TM3 that runs diagonally through the 
membrane with a 45° kink at Pro71. Although some previous studies 
have suggested that the SEDS protein FtsW could function as a lipid 
II flippase’®, RodA lacks a transmembrane channel suitable for lipid II 
transport and bears no notable structural similarity to transporters or 
flippases. 

The intracellular loops of RodA are structured but are very short, 
and little polar surface area is exposed on the intracellular face of the 
enzyme. By contrast, extracellular loops (ECLs) 2, 4and 5 are large and 
contain many functionally essential residues’, consistent with cata- 
lytic activity occurring at the extracytoplasmic face of the membrane. 
ECL2 includes a highly conserved B-hairpin, capped by Gly100 and 
Pro101. ECL4 is even larger at 80 amino acids in length, but is not 
resolved from residues 189 to 227 as well as from 237 to 251. These 
regions include essential residues as well as high-ranking evolutionary 
couplings between ECL4 and ECL1, ECL2 and ECLS5, which suggests 
that they have functionally important roles despite not being resolved 
in the current structure (Extended Data Figs 3, 4). It is possible that 
these regions become ordered only upon substrate binding, or in com- 
plex with a peptidoglycan crosslinking enzyme. Unlike other ECLs, 
ECLS5 is not exposed to the surface and instead is buried within the 
protein core. 

Between TM2 and TM3 is a long hydrophobic groove containing 
electron density suggestive of a bound lipid molecule, which we ten- 
tatively modelled as monoolein owing to its high concentration in the 
crystallization conditions (Fig. 2b, c, Extended Data Figs 2, 5). This 
groove is adjacent to a collection of highly conserved residues (Fig. 2c), 
and may represent the binding site for the lipid-anchored substrates of 
RodA. Adjacent to this groove is a large water-filled cavity open to the 
extracellular face of the protein, flanked by Glu108, Met306, Leu307, 
Gln310 and Thr342 (Fig. 2d). On the edge of this cavity, Glul108 and 


Cc 
47.8A 


MR solutions in cluster 
Refined structure 


Viewed from 
extracytoplasmic side 


Lys111 form an absolutely conserved salt bridge (Fig. 2e). Mapping the 
results of previous high-throughput mutagenesis in B. subtilis RodA 
onto the structure shows that the salt bridge and other residues in the 
central cavity are intolerant of substitution’ (Fig. 3a). 

To investigate the function of this central cavity in more detail, we 
turned to site-directed mutagenesis followed by phenotypic characteri- 
zation in the representative model Gram-negative and Gram-positive 
organisms E. coli and B. subtilis. First, we assessed whether the con- 
served salt bridge between Glu108 and Lys11 lis essential, by moni- 
toring the effect of the mutant enzyme when expressed in a wild-type 
background. Consistent with previous results, mutation of either of 
these residues to alanine resulted in a dominant-negative phenotype 
characteristic of Rod complex dysfunction, in which cells lose their 
elongated morphology and become enlarged and spherical before lysis 
(Fig. 3b). This suggests that the mutant proteins are properly folded and 
able to interact with other members of the Rod complex, but are unable 
to promote cell elongation. 

To ascertain whether the salt bridge is catalytic or merely important 
for proper folding, we constructed a salt-bridge swap mutant (that is, an 
E108K and K111E double mutant). If the role of the salt bridge is purely 
structural, this swap should have a minimal effect on RodA function, 
because the salt bridge is maintained. However, if Glu108 has a role 
in catalysis, then swapping the two amino acids would be expected to 
abrogate function. Indeed, the mutant protein folds properly as meas- 
ured by circular dichroism spectroscopy (Extended Data Fig. 6), but 
in cell assays the mutant protein shows a strong dominant-negative 
phenotype (Fig. 3b, c, Extended Data Fig. 7). Moreover, the mutant 
enzyme has no detectable peptidoglycan polymerization activity 
in vitro, confirming that its toxicity derives from a lack of catalytic 
activity (Fig. 3d, Supplementary Fig. 1). Other residues near the central 
cavity were similarly essential for RodA function, including Asp255 
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Figure 2 | Structure of RodA. a, Structure of RodA viewed parallel to 
the membrane plane (left) and from the extracytoplasmic side (right). 

b, Surface view of RodA showing electrostatic potential and the position 
of a bound lipid between TM2 and TM3 (orientation is identical to that 
in a. c, Close-up view of bound lipid (yellow). RodA surface is coloured 
by sequence conservation. d, In the centre of RodA there is a water-filled 
cavity open to the extracytoplasmic surface. e, The water-filled cavity 

of RodA is flanked by highly conserved polar residues and a salt bridge 
(yellow dashed line) between Glu108 and Lys111. 


as previously reported!, as well as Asp152 (Fig. 3b, Extended Data 
Fig. 7). Analysis of evolutionary co-variation data using EVmutation”” 
likewise predicts these residues, and the salt bridge, to be immutable. 
Taken together, the high degree of sequence conservation, intolerance 
to mutation and catalytic essentiality of residues surrounding the 
central cavity confirm that this portion of the protein has a critical 
role in peptidoglycan polymerization, which makes it a prime target 
for the development of antibiotics. 

The glycan-strand polymerization process catalysed by RodA and 
aPBPs is essential but not sufficient to build a cell wall. A second 
key step is peptide crosslinking, catalysed by the penicillin-binding 
domains of aPBPs and class B penicillin-binding proteins (bPBPs). 
Cytological and protein-protein interaction studies indicate that SEDS 
proteins and bPBPs are likely to form a complex in cells'’, and evo- 
lutionary coupling analysis shows strong co-variation between bPBP 
and SEDS protein sequences. This is sufficient to map the binding site 
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Figure 3 | The central cavity is essential for RodA function. a, Homology 
model of B. subtilis RodA with each residue coloured according to its 
tolerance for mutation (see Methods). Residues are numbered according to 
B. subtilis, with corresponding T. thermophilus numbering in parentheses. 
b, Wild-type E. coli cells harbouring a plasmid with indicated RodA 
mutants. Scale bar, 541m. WT, wild type. c, Expression of RodA charge- 
swap mutant is toxic in B. subtilis. Intracellular mCherry expression 
indicates cytosol. For both b and c, images are representative of three 
independent experiments. IPTG, isopropyl-$-p-thiogalactoside. 

d, Catalytic competence of B. subtilis RodA mutants, representative of two 
independent experiments. Asterisk indicates the PBP4 labelling enzyme 
used for detection; StgB, SgtB(Y181D) used as control. 


between bPBPs and RodA to TM8 and TM9 (Fig. 4a, b, Extended Data 
Fig. 8), corresponding to the proposed interaction site between the 
divisome SEDS protein FtsW and its corresponding bPBP, FtsI!°7!. 
RodA mutants in this interface exhibit a dominant-negative effect in 
E. coli (Fig. 4c). However, mutation of this site does not prevent RodA- 
mediated peptidoglycan polymerization in vitro (Extended Data Fig. 7), 
confirming the functional importance of coordinated glycan strand 
elongation and peptide crosslinking in cells. 

Complexes between SEDS proteins and bPBPs contain both 
glycan-strand polymerization and peptide crosslinking active sites, 
recapitulating the dual catalytic activities found in aPBPs (Fig. 4d). 
Understanding how SEDS proteins coordinate their activity with 
that of bPBPs to build a cell wall will be an important area for future 
investigation. The structure of RodA now provides a foundation for 
such work, serving as a framework for understanding the function of 
SEDS proteins. 
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Figure 4 | Interaction between RodA 

and its class B pencillin-binding protein, 
PBP2. a, Evolutionary co-variation map 
showing 19 evolutionary couplings between 
RodA and PBP2. b, Representation of 
evolutionary co-variation. c, Mutations in 
RodA-bPBP interface result in morphological 
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Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 


Received 15 October 2017; accepted 8 February 2018. 
Published online 28 March 2018. 


1. Meeske, A. J. et al. SEDS proteins are a widespread family of bacterial cell wall 
polymerases. Nature 537, 634-638 (2016). 

2. Cho, H. et a/. Bacterial cell wall biogenesis is mediated by SEDS and PBP 
polymerase families functioning semi-autonomously. Nat. Microbiol. 1, 16172 
(2016). 


3. Emami, K. et al. RodA as the missing glycosyltransferase in Bacillus subtilis and 
antibiotic discovery for the peptidoglycan polymerase pathway. Nat. Microbiol. 
2, 16253 (2017). 

4. Goffin, C. & Ghuysen, J.-M. Multimodular penicillin-binding proteins: an 


enigmatic family of orthologs and paralogs. Microbiol. Mol. Biol. Rev. 62, 
1079-1093 (1998). 
5. McPherson, D. C. & Popham, D. L. Peptidoglycan synthesis in the absence of 
class A penicillin-binding proteins in Bacillus subtilis. J. Bacteriol. 185, 
1423-1431 (2003). 
Packiam, M., Weinrick, B., Jacobs, W. R., Jr & Maurelli, A. T. Structural 
characterization of muropeptides from Chlamydia trachomatis peptidoglycan 
by mass spectrometry resolves ‘chlamydial anomaly’. Proc. Natl Acad. Sci. USA 
112, 11660-11665 (2015). 
7. Otten, C., Brilli, M., Vollmer, W., Viollier, P. H. & Salje, J. Peptidoglycan in obligate 
intracellular bacteria. Mol. Microbiol. 107, 142-163 (2018). 


8. DiMaio, F. et a/. Improved molecular replacement by density- and energy- 
guided protein structure optimization. Nature 473, 540-543 (2011). 
9. Hopf, T. A. et al. Three-dimensional structures of membrane proteins from 
genomic sequencing. Cel/ 149, 1607-1621 (2012). 
0. Hopf, T. A. et a/. Sequence co-evolution gives 3D contacts and structures of 


protein complexes. eLife 3, e03430 (2014). 
1. Marks, D. S. et al. Protein 3D structure computed from evolutionary sequence 
variation. PLoS ONE 6, e28766 (2011). 
2. McCoy, A. J. et al. Phaser crystallographic software. J. Appl. Crystallogr. 40, 
658-674 (2007). 
. DiMaio, F. et al. Improved low-resolution crystallographic refinement with 
Phenix and Rosetta. Nat. Methods 10, 1102-1104 (2013). 
. Qian, B. et al. High-resolution structure prediction and the crystallographic 
phase problem. Nature 450, 259-264 (2007). 
5. Rigden, D. J., Keegan, R. M. & Winn, M. D. Molecular replacement using 
ab initio polyalanine models generated with ROSETTA. Acta Crystallogr. D 64, 
1288-1291 (2008). 


16. McCoy, A. J. et al. Ab initio solution of macromolecular crystal structures 
without direct methods. Proc. Nat! Acad. Sci. USA 114, 3637-3641 (2017). 
Strop, P., Brzustowicz, M. R. & Brunger, A. T. Ab initio molecular-replacement 
phasing for symmetric helical membrane proteins. Acta Crystallogr. D 63, 
188-196 (2007). 

Holm, L. & Rosenstrom, P. Dali server: conservation mapping in 3D. Nucleic 
Acids Res. 38, W545-W549 (2010). 

Mohammadi, T. et al. Identification of FtsW as a transporter of lipid-linked cell 
wall precursors across the membrane. EMBO J. 30, 1425-1432 (2011). 
Hopf, T. A. et al. Mutation effects predicted from sequence co-variation. 

Nat. Biotechnol. 35, 128-135 (2017). 

Ovchinnikov, S. et a/. Large-scale determination of previously unsolved protein 
structures using evolutionary information. eLife 4, e€09248 (2015). 

Leclercq, S. et al. Interplay between penicillin-binding proteins and SEDS 
proteins promotes bacterial cell wall synthesis. Sci. Rep. 7, 43306 (2017). 


17. 


18. 
19. 
20. 
21. 
22. 


Supplementary Information is available in the online version of the paper. 


Acknowledgements Financial support for the work was provided by NIH grant 
U19A1109764 (A.C.K., D.Z.R., T.G.B., S.W. and D. K.), NIH grant RO1GM106303 
(D.S.M.) and a CIHR doctoral research award to P.D.A.R. We thank Advanced 
Photon Source GM/CA beamline staff for technical support during X-ray data 
collection, K. Arnett (Harvard Center for Macromolecular Interactions) for 
support of circular dichroism experiments and C. Sander for discussions. 


Author Contributions M.S. and AJ.M. performed expression screening 
experiments, and M.S. performed large-scale purification and crystallization of 
RodA as well as enzyme assays and circular dichroism spectroscopy. Additional 
input regarding enzyme assays was provided by P.D.A.R, V.S., D.K. and S.W. The 
structure was solved and refined by M.S. and A.C.K. using evolutionary coupling- 
derived models developed by K.B., A.G.G., T.A.H. and D.S.M. Assessment of RodA 
mutant phenotypes was conducted by G.D. and P.D.A.R. with supervision from 
T.G.B. and D.Z.R. Overall project supervision was performed by A.C.K. with input 
from T.G.B. and D.Z.R. The manuscript was written by M.S. and A.C.K. with input 
from other authors. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing interests. Readers 
are welcome to comment on the online version of the paper. Publisher's 

note: Springer Nature remains neutral with regard to jurisdictional claims in 
published maps and institutional affiliations. Correspondence and requests for 
materials should be addressed to A.C.K. (andrew_kruse@hms.harvard.edu). 


Reviewer Information Nature thanks R. Read, K. Young and the other 
anonymous reviewer(s) for their contribution to the peer review of this work. 


5 APRIL 2018 | VOL 556 | NATURE | 121 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized and investigators were not blinded to allocation during 
experiments and outcome assessment. 

Protein purification. RodA from T. thermophilus was cloned into pAM172 
plasmid! using EcoR1 and Avril restriction enzymes (resulting in plasmids 
pMS211 and pMS224 for wild-type and D255A RodA, respectively). The expres- 
sion plasmids contain an amino-terminal SUMO-fusion followed by a Flag epitope 
tag and a 3C protease cleavage site (SUMO-Flag-3C-RodA and SUMO-Flag-3C- 
RodA(D255A)), and were transformed into E. coli C43 derivative of BL21 (DE3) 
harbouring an arabinose-inducible Ulp1 protease plasmid (pAM174) under the 
selection for both plasmids. Five fresh transformants harbouring both plasmids 
were inoculated into 5 ml LB medium supplemented with 100j1g ml! ampicillin 
and 351g ml“! chloramphenicol and allowed to grow overnight at 37 °C in a rolling 
shaker. The 5-ml overnight culture was then diluted in 1 litre of TB broth supple- 
mented with 0.1% glucose, 2mM MgCh, 100g ml“! ampicillin, and 351g ml! 
chloramphenicol. Cultures were grown at 37°C until an ODg00 of 0.6 and shifted 
to 20°C. At an ODg09 of 0.8, protein expression was induced by addition of IPTG 
(1 mM final) and arabinose (0.2% final) for RodA and Ulp1, respectively. After a 
16-h induction, cells were collected and frozen at —80°C. 

Cells were resuspended in lysis buffer (50 mM HEPES pH 7.5, 150mM NaCl, 
20mM MgCl, 100 jg ml"! lysozyme, 1:100,000 (v/v) benzonase nuclease, and 
2mg ml | iodoacetamide), lysed by sonication and membranes were collected by 
ultracentrifugation at 100,000g for 1h at 4°C. Flag~3C-RodA was then extracted 
using a glass dounce tissue grinder in a solubilization buffer containing 20 mM 
HEPES pH 7.5, 500mM NaCl, 20% (v/v) glycerol, 2mg ml! iodoacetamide, and 
1% (w/v) N-dodecyl 8-p-maltoside (DDM; Anatrace). Samples were stirred for 
2h at 4°C, then centrifuged as above for 1h. The supernatant containing solubi- 
lized RodA was supplemented with 2mM CaCl, and loaded by gravity flow onto 
4ml anti-Flag antibody affinity resin. The resin was washed extensively, first in 
50 ml of buffer containing 20 mM HEPES pH 7.0, 500mM NaCl, 2mM CaCh, 
20% glycerol, 0.1% DDM, and then in 50 ml of the same buffer supplemented with 
10mM adenosine 5’-triphosphate magnesium salt and 20mM KCl to remove the 
bacterial chaperones, GroEL and DnaK. RodA was eluted in 20 mM HEPES pH 
7.0, 500 mM NaCl, 20% glycerol, 0.1% DDM supplemented with 5 mM EDTA and 
0.2 mg ml! Flag peptide. 3C protease was added (1:1,000 w/w) and incubated with 
RodA at 4°C overnight. RodA was further purified by size exclusion chromato- 
graphy (SEC) on a Sephadex $200 column (GE Healthcare) in buffer containing 
20mM HEPES pH 7.5, 500 mM NaCl and 0.1% DDM. After preparative SEC, the 
protein was concentrated to 30-40 mg ml! and flash-frozen with liquid nitrogen 
in aliquots of 8 jl. Samples were stored at —80°C until use for crystallography. 
Purity and monodispersity of crystallographic samples was evaluated by 
SDS-PAGE and analytical SEC, respectively. 

Crystallography and data collection. Purified T: thermophilus wild-type and 
D255A RodA were reconstituted into lipidic cubic phase by mixing with a 10:1 
(w/w) mix of monoolein (Hampton Research) with cholesterol (Sigma Aldrich) 
at a ratio of 1.0:1.5 protein:lipid by mass, using the coupled syringe reconstitu- 
tion method”*. All samples were mixed at least 100 times before dispensing. The 
resulting phase was dispensed in 15-40 nl drops onto a glass plate and overlaid 
with 600 nl of precipitant solution using a Gryphon LCP robot (Art Robbins 
Instruments). Crystals grew in precipitant solution containing 35-50% PEG 200, 
100mM NaCl, 100mM MgCl, and 100 mM Tris pH 7.6-8.2. Initial crystalliza- 
tion hits grew within 24h, with diffraction-quality crystals reaching full size over 
the course of 2-4 weeks. Crystals were collected using mesh loops and stored in 
liquid nitrogen until data collection. Data collection was carried out at Advanced 
Photon Source GM/CA beamiline 23ID-B. An initial grid raster with 80 x 30-j4m 
beam dimensions was performed to locate crystals within the loop. Additional 
fine-tuning rasters were performed using a 10-j1m beam diameter to optimize 
the position of the crystal for data collection. Data were collected using a 10-j1m 
beam and 0.2°-oscillation width per frame at a wavelength of 1.033 A and a fivefold 
attenuation factor. For both wild-type and D255A RodA, a complete dataset was 
obtained from a single crystal. Diffraction data were indexed and processed using 
XDS”*. Both wild-type and D255A RodA crystallized in the C2 space group with 
one molecule in the asymmetric unit, with a solvent content of approximately 60%. 
Generation of ab initio evolutionary coupling-derived models of RodA. Multiple 
sequence alignments (MSA) of the full length T. thermophilus RodA (Uniprot 
ID Q5SIX3) were generated using the iterative hidden Markov model-based 
sequence search tool jackhmmer”® with five iterations. Alignments were built 
using the Uniref100 dataset”® released in April 2017. An MSA was generated for 
nine different bitscores, a sequence inclusion threshold was normalized to length 
and expressed as the number of bits per residue, with values ranging from the 
most inclusive (0.1) to the least inclusive (0.9). The alignment depth was chosen 
to optimize the number of non-redundant sequences with the fewest gaps in the 


alignment as previously described®”, although the alignment choice was robust 
with respect to consistency of predicted evolutionary couplings over a wide range 
of alignment depths. Blindly optimizing the alignment choice resulted in two 
alignments, with the smaller one at 31,505 non-redundant sequences of length-346 
residues (10-355) and an ‘effective number’ of 8,729 sequences after down- 
weighting sequences with more than 80% identity and no more than 30% gaps in 
any columns used for the EC model computation. 

Model folding criteria. For the chosen MSAs, evolutionary couplings were deter- 
mined using a pseudo-likelihood maximization (PLMC)’”~”’. A mixture model 
approach identified the 99% percentile probability of being in contact. We used 
a combination of commonly used prediction methods to determine secondary 
structure predictions (PSIPRED™ and PolyPhobius*!), together with secondary 
structure propensity computed directly from local evolutionary couplings”’. This 
resulted in the identification of ten transmembrane helices and three smaller 
helices in ECL4, and two 8-strands in ECL2. A total of 220 folded models for 
T. thermophilus RodA were generated for increasing numbers of evolutionary 
coupling restraints using the folding protocol in EVfold’, which uses a distance 
geometry and simulated annealing protocol in CNS****. All models were ranked 
as previously described'!* and the 50 top-ranked models from each of the MSAs 
were used as molecular replacement search models (see below). To generate 
additional models used for further ensemble-based phasing runs, five additional 
models were generated using the top-ranked fold prediction from bitscore 0.8 with 
an additional round of dedicated simulated annealing in CNS. Cartesian dynamics 
were used for heating, torsion for cooling and 600 final minimization steps with 50 
cycles were used with other parameters kept as defaults. In addition to evolutionary 
couplings scores, for folded models EVfold generates an evolutionary coupling 
enrichment score for each residue that reflects how constrained that residue is, 
which can be thought of as conservation of ‘coupling. 

Phasing and refinement. One hundred evolutionary coupling-derived models of 
wild-type RodA were primed for molecular replacement using Sculptor*’, which 
included limited side-chain pruning and B-factor assignment based on accessible 
surface area*®. Each model was tested as a single search template for molecular 
replacement in Phaser’, resulting in 98 out of the 100 initial models providing 
candidate solutions. The solutions were sorted by translation function Z-score 
(TF-Z) and the top 32 solutions manually inspected in PMOL”. A total of 22 
solutions were highly similar and distinct from all other solutions. Sorting the 
solutions by log-likelihood gain (LLG) provided a similar result, in which all 22 
solutions were among the top 40% of solutions. However, most of these were not 
well separated from other potential solutions in their respective searches, typically 
with two-to-four other candidate solutions giving LLG values at least 75% of that 
of the top solution. The models giving the top two solutions (as judged by both 
TF-Z and LLG metrics) were used as a single ensemble for phase calculation 
followed by manual building using COOT* and reciprocal space refinement using 
phenix.refine*’. Manual refinement was complemented by the use of ROSETTA 
refinement in Phenix)’. Subsequently, additional cycles of manual building and 
reciprocal space refinement led to the final refined structure. 

Verification of sequence register was straightforward and unambiguous owing 
to the relatively high resolution and frequency of bulky amino acid side chains 
(RodA is roughly 10% tryptophan, tyrosine and phenylalanine). Representative 
composite omit map density is shown in Extended Data Fig. 1. The structure of 
RodA(D255A) was solved using wild-type RodA as the molecular replacement 
search model and the resulting refined structure is nearly identical to that of wild- 
type RodA (0.1 Arm.s.d. between all C, atoms). After refinement, the quality of 
both structures was assessed using MolProbity to calculate Ramachandran statistics 
and other parameters”, and figures were prepared in PyMOL. All crystallographic 
data processing, refinement and analysis software was compiled and supported by 
the SBGrid Consortium"). 

Predicting residue contacts between RodA and PBP2. EVComplex"” was used 
to predict inter-protein contacts between T. thermophilus full-length RodA and 
full-length PBP2 (Uniprot ID Q5SJ23). We constructed alignments for RodA 
(33,670 sequences) and PBP2 (40,764 sequences) as described above for RodA 
alone, using the April 2017 Uniprot release” for clarity on species identifiers. We 
concatenated RodA and PBP2A sequences from each species when they were 
within 10,000 nucleotides of each other, based on European Nucleotide Archive 
data downloaded in February 2017", and evolutionary couplings were computed 
on the complex alignment as previously described'’. A mixture model approach 
was used to identify top-scoring contacts, both within individual monomers and 
between RodA and PBP2, and to assign a probability to each contact of being 
within the tail of the distribution of coupling scores as previously described”’. 
This scoring method is accurate for predicting whether proteins interact as well 
as which residues are in contact!°. The distribution of evolutionary couplings is 
approximated by a Gaussian log-normal mixture model, and we defined the tail 
of the distribution as those scores that have >95% probability of belonging to 
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the log-normal component. Evolutionary couplings in this tail defined as high 
probability resulted in 26 residue pairs predicted as contacts between RodA and 
PBP2. 

Sequence and structure conservation analysis. The sequence conservation 
analysis shown in Fig. 3 and Extended Data Fig. 5 was computed using the ConSurf 
server“. In brief, a multiple sequence alignment of T. thermophilus RodA to its 
closest 150 homologues was generated using the HHMER algorithm provided by 
ConSurf, with conservation scores plotted in PyYMOL. Additionally, a multiple 
sequence alignment of RodA to 506 homologues from representative bacterial 
taxa was generated using a protein sequence BLAST search on the NCBI public 
database using T: thermophilus RodA protein sequence as query, and plotted in 
Extended Data Fig. 3. 

Homology modelling and mutability index. A homology model of B. subtilis 
RodA based on the structure of T: thermophilus RodA was constructed using 
MODELLER™. In brief, owing to the low sequence identity (~26%) between the 
two enzymes, a multiple sequence alignment of 10 RodA homologues from diverse 
bacterial taxa was performed (T. thermophilus, B. subtilis, E. coli, Staphylococcus 
aureus, Streptococcus pneumoniae, Shigella flexneri, Haemophilus influenzae, 
Deinococcus marmoris, Bacillus safensis and Lysinibacillus odysseyi). The resulting 
alignment was used as the search template in MODELLER, in which disordered 
residues 204-256 and 266-276 were omitted from the model generation. A total 
of ten models were generated and the top-scoring model was chosen for further 
analysis. 

The mutability index of each residue in the homology model of B. subtilis 

RodA was calculated based on previously reported MutSeq data’. First, for each 
residue the nonsynonymous mutation rate was defined as the number of mutations 
observed after excluding synonymous and nonsense mutations. To calculate a raw 
mutability index, the nonsynonymous mutation rate was divided by the summed 
rate of silent mutations observed for a given nucleotide position. Finally, the raw 
mutability index for each residue was normalized with respect to the expected 
silent mutation rate for each general class of nucleotide change, resulting in a 
per cent mutability index for each residue. A low mutability index (that is, low 
percentage) represents a residue that is relatively intolerant of mutation at that 
position and a high mutability index (high percentage) represents a residue that is 
relatively tolerant of changes at that position. 
RodA mutant plasmid and strain construction. Mutants of E. coli RodA were 
introduced using QuikChange mutagenesis into a plasmid containing the allele 
Piac-pbpA-rodA(WT) (pHC857)!”. Wild-type E.coli cells (strain TB28) were then 
transformed with the mutant plasmids to generate strains that each harboured 
mutations in the central cavity (E114A, K117A, E114K/K117E and D159A) and 
at the RodA-PBP2 interface (L240S, $326A and L281A). 

Mutations of B. subtilis RodA were introduced using QuikChange mutagenesis 

into the pER174a plasmid (amyE::P,,-rodA(WT)-10 xHis)!. The resulting muta- 
genic plasmids (amyE::P,,)-rodA (allele)-10 x His) was directly transformed into the 
B. subtilis strain (rodA::kan, Pspank-rodA(WT)-10 XHis, and PpensacA::mCherry) to 
generate strains each harbouring mutations in the central cavity (Q114A, E117A, 
K120A, E117K/K120E, D280N and S364A), the proposed lipid-binding cavities 
(1113A, 1113S, and F118A and A171L), and the RodA-PBP2? interface (F292A). The 
B. subtilis strains harbouring amyE::P,,-rodA-(D167A), amyE::P,-rodA(D167N), 
and amyE::P,-rodA(D280A) have previously been described!. 
Mutational analysis in vivo. All E. coli cultures were grown at 37°C in M9 minimal 
medium supplemented with 0.2% (w/v) maltose, 0.2% (w/v) casamino acids, and 
25g ml! chloramphenicol. Overnight cultures were diluted to an OD¢00 of 0.05 
and grown to OD¢09 = 0.25 in the absence of inducer. These cultures were then 
further diluted to an OD600 of 0.005 in medium containing 1 mM IPTG to induce 
expression of the pbpA-rodA construct. Cells were grown for five generations 
in the presence of inducer and fixed when the OD¢o9 reached between 0.15 and 
0.2. Fixative solution contained 0.04% glutaraldehyde, 4% formaldehyde, 32 mM 
sodium phosphate, pH 7.5. Wide-field phase-contrast microscopy was performed 
ona Nikon TE2000 microscope equipped with a 100 Plan Apo 1.4 NA objective 
and a CoolSNAP HQ2 monochrome camera. 

B. subtilis strains were derived from the PY79 prototrophic strain**. Cells were 
grown in LB medium at 37°C in the presence of 101M IPTG to express wild-type 
RodA. When cultures reached mid-log, cells were washed three times with plain 
LB medium and resuspended in 20 ml of LB medium to an OD¢0p of 0.02, then 
supplemented with 10 mM xylose to induce expression of RodA wild type and 
variants. Cells were analysed by phase-contrast microscopy 90 min later. 
Microscopy was performed on a Nikon Ti microscope equipped with Plan Apo 
100/1.4NA phase-contrast oil objective and a CoolSNAP HQ2 camera. Cells were 
immobilized using 2% agarose pads, containing growth medium. Images were 
cropped and adjusted using MetaMorph software (Molecular Devices). 

Mutant analysis in vitro. Mutants of B. subtilis RodA were transformed into 
E. coli strain CAM333, a derivative of strain C43 with deletions in ponB, pbpC and 
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mgtA, and were expressed and purified by immunoaffinity chromatography as 
previously described’. 

Before circular dichroism spectroscopy experiments, each RodA mutant was 
dialysed into a buffer consisting of 10 mM sodium phosphate pH 7.5, 500 mM 
potassium fluoride, 0.5% (w/v) CHAPS, and 0.05% (w/v) DDM. Spectra were 
acquired on a Jasco J-815 spectropolarimeter. Circular dichroic spectra were 
recorded between 200-260 nm using a quartz cuvette with a path length of 1 mm, a 
50-nm/min scanning speed, and a bandwidth of 1 nm. Five spectra were measured 
at 25°C, averaged and corrected for buffer contribution. Secondary structure 
assessment was not performed owing to high absorbance contributions at wave- 
lengths less than 205 nm from the detergent mixture necessary for protein stability 
and function (0.5% CHAPS, 0.05% DDM). 

For assessment of enzymatic activity, lipid II substrate was purified from 
E. coli as described*’. Peptidoglycan polymerization reactions were adapted from 
previously described methods“. In brief, purified B. subtilis wild-type and mutant 
RodA proteins were incubated with lipid II in reaction buffer containing 50 mM 
HEPES pH 7.0, 20mM CaCh, 20mM MgCl, and 20% DMSO. The working con- 
centration of RodA and lipid H were 141M and 20\.M, respectively. RodA was 
purified in 0.5% CHAPS and 0.05% DDM, and therefore the working concen- 
tration for CHAPS and DDM was 0.05% and 0.005%, respectively. As a positive 
control lipid II was also polymerized with SgtB(Y181D) (11M) in reaction buffer 
containing 12.5mM HEPES pH 7.5, 2mM MnCh, 0.25 mM Tween-80 and 20% 
DMSO. All reactions were incubated at 25°C for 1h and quenched by incuba- 
tion at 95°C for 2 min. Peptidoglycan biotinylation of each reaction mixture was 
performed by addition of biotinylated p-lysine (1.5 mM, final concentration) and 
PBP4 (3.8 1M, final concentration) followed by incubation at 25°C for 1h. The 
biotinylation reaction was then quenched by addition of 1311 2x SDS loading 
dye. The samples were then loaded into a 4-20% gradient polyacrylamide gel and 
run at 180 V. The products were transferred onto a PVDF membrane (BioRad) and 
fixed in 0.4% paraformaldehyde diluted in PBS for 30 min at room temperature. 
The membrane was blocked with SuperBlock TBS blocking buffer (Thermo Fisher 
Scientific) for 1h at room temperature and the biotinylated products were detected 
by incubation with fluorescently tagged streptavidin ([RDye 800-CW streptavidin 
(Li-Cor Biosciences, 1:5,000 in SuperBlock)) for an additional 30 min. Membranes 
were washed 4 x 10 min with TBST (0.01% Tween 20) and given a final 10-min 
wash in PBS. Blots were then visualized using an Odyssey CLx imaging system 
(LI-COR Biosciences). 

Code availability. The full EVfold software package is available at https://github. 
com/debbiemarkslab/EVcouplings. 

Data availability. Structure factors and refined atomic coordinates for RodA wild 
type and the RodA(D255A) mutant are deposited in the RCSB Protein Data Bank 
under accession codes 6BAR and 6BAS, respectively. 


23. Caffrey, M. & Cherezov, V. Crystallizing membrane proteins using lipidic 
mesophases. Nat. Protoc. 4, 706-731 (2009). 

24. Kabsch, W. XDS. Acta Crystallogr. D 66, 125-132 (2010). 

25. Johnson, L. S., Eddy, S. R. & Portugaly, E. Hidden Markov model speed heuristic 
and iterative HMM search procedure. BMC Bioinformatics 11, 431 (2010). 

26. Suzek, B. E., Wang, Y., Huang, H., McGarvey, P. B. & Wu, C. H. UniRef clusters: 

a comprehensive and scalable alternative for improving sequence similarity 
searches. Bioinformatics 31, 926-932 (2015). 

27. Ekeberg, M., Lovkvist, C., Lan, Y., Weigt, M. & Aurell, E. Improved contact 
prediction in proteins: using pseudolikelihoods to infer Potts models. Phys. Rev. 
E 87, 012707 (2013). 

28. Balakrishnan, S., Kamisetty, H., Carbonell, J. G., Lee, S.-l. & Langmead, C. J. 
Learning generative models for protein fold families. Proteins 79, 1061-1078 
(2011). 

29. Toth-Petroczy, A. et al. Structured states of disordered proteins from genomic 
sequences. Cell 167, 158-170.e12 (2016). 

30. Jones, D. T. Protein secondary structure prediction based on position-specific 
scoring matrices. J. Mol. Biol. 292, 195-202 (1999). 

31. Kall, L, Krogh, A. & Sonnhammer, E. L. L. Advantages of combined 
transmembrane topology and signal peptide prediction—the Phobius web 
server. Nucleic Acids Res. 35, W429-W432 (2007). 

32. Brunger, A. T. Version 1.2 of the Crystallography and NMR system. 

Nat. Protoc. 2, 2728-2733 (2007). 

33. Bringer, A. T. et al. Crystallography & NMR system: a new software suite for 
macromolecular structure determination. Acta Crystallogr. D 54, 905-921 
(1998). 

34. Marks, D. S., Hopf, T. A. & Sander, C. Protein structure prediction from 
sequence variation. Nat. Biotechnol. 30, 1072-1080 (2012). 

35. Bunkéczi, G. & Read, R. J. Improvement of molecular-replacement models with 
Sculptor. Acta Crystallogr. D 67, 303-312 (2011). 

36. Schwarzenbacher, R., Godzik, A., Grzechnik, S. K. & Jaroszewski, L. The 
importance of alignment accuracy for molecular replacement. Acta Crystallogr. 
D 60, 1229-1236 (2004). 

37. Schrodinger. The PYMOL Molecular Graphics System, Version 1.3r1 
(Schrodinger, 2010). 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


38. 
39. 
40. 
4l. 
42. 
43. 


Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta 
Crystallogr. D 60, 2126-2132 (2004). 

Afonine, P. V. et a/. Towards automated crystallographic structure refinement 
with phenix.refine. Acta Crystallogr. D 68, 352-367 (2012). 

Chen, V. B. et al. MolProbity: all-atom structure validation for macromolecular 
crystallography. Acta Crystallogr. D 66, 12-21 (2010). 

Morin, A. et al. Collaboration gets the most out of software. eLife 2, e01456 
(2013). 

The UniProt Consortium. UniProt: the universal protein knowledgebase. 
Nucleic Acids Res. 45, D158-D169 (2017). 

Pakseresht, N. et al. Assembly information services in the European Nucleotide 
Archive. Nucleic Acids Res. 42, D38-D43 (2014). 


44. 


45. 
46. 


47. 
48. 


Ashkenazy, H., Erez, E., Martz, E., Pupko, T. & Ben-Tal, N. ConSurf 2010: 
calculating evolutionary conservation in sequence and structure of proteins 
and nucleic acids. Nucleic Acids Res. 38, W529-W533 (2010). 

Webb, B. & Sali, A. Comparative protein structure modeling using MODELLER. 
Curr. Protoc. Protein Sci. 54, 5.6.1-5.6.37 (2016). 

Youngman, P. J., Perkins, J. B. & Losick, R. Genetic transposition and insertional 
mutagenesis in Bacillus subtilis with Streptococcus faecalis transposon Tn917. 
Proc. Natl Acad. Sci. USA 80, 2305-2309 (1983). 

Qiao, Y. et al. Lipid Il overproduction allows direct assay of transpeptidase 
inhibition by 6-lactams. Nat. Chem. Biol. 13, 793-798 (2017). 

Qiao, Y. et a/. Detection of lipid-linked peptidoglycan precursors by exploiting an 
unexpected transpeptidase reaction. J. Am. Chem. Soc. 136, 14678-14681 (2014). 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


Cytoplasmic 


Viewed from 
extracytoplasmic side 


as z pee Reee ts 


Extended Data Figure 1 | Representative electron density. a-c, Simulated _ water molecules, respectively. The modelled water molecules are shown 
annealing composite omit 2F, — F, electron density map of T. thermophilus as red spheres. e, The same map contoured at 1.0c within a 3.0 A radius 
RodA contoured at 1.00 within a 2.0 A radius of atoms shown. d, The RodA (shown in blue) and the same map contoured at 1.00 for monoolein 
same map contoured at 1.00 and coloured blue and green for RodA and (shown in black). 
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Extended Data Figure 2 | Comparison of RodA wild type and structure is shown as a sphere, and coloured pink (wild type) or yellow 
RodA(D255A) structures. Structures of wild-type (green) and D255A (D255A). The dashed lines represent the disordered residues 189-227 and 
(blue) RodA are shown viewed parallel to the membrane (left) and from 237-251 in both structures. The two structures are essentially identical, 
the extracytoplasmic side (right). The C,, atom of residue 255 for each with a C, rm.s.d. of 0.1 A. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


10 40 50 
| I 1 1 
Thermus thermophilus = ------ eee e SPA PDP--------- VLLYROSVALGLGL 
Escherichia coli yee o Posie nd fey\SGQDIGMMER-------- KIGQIAMGL 
Bacillus subtilis 9 ----- MSKYKKOQSPFYOGU 1150) er nS} YGNTDWIQOLIVFYLLGA 
Staphylococcus aureus METSTR- cea taifks YETNFLIKQVLYYVLGF 
Streptococcus pneumoniae Gee saaer ee fi taifks Is YSANFGIRQLIYYILGA 
Deinococcus geothermalis = ---------------- MS LOWVIAQVLIELT LGLLGWAMANA P DK IT LDHGS----------- KALLAL 
Enterococcus faecalis ----MNRKEKTNLDSRIBY GIL PYF LIES L.LGML SI§Y VIAL YNDPSKP--KIGSLLMKOGLWYLVGG 
ca 7 @ % , , ] 
Thermus thermophilus LIAPELROFESRRREFGLAYPOMGAS ---iliD LV -REIN-- 
Escherichia coli VIMVVMAOTPPRVYEGWAPY LMI DC---fiT DAF@AIS---k 
Bacillus subtilis VALTVLLY FDLEQUEKLS LY OMI G1 LSURIGMEK 1SPES-IAPVI-K| 
Staphylococcus aureus TLATALMFITS PKTLRKYVWI IMI TGNLS THWPESSFTPII-N 
Streptococcus pneumoniae TLALTVMIFSPKKIKNNTYIWMTF FCT LLG TMPETPTTPTI- 
Deinococcus geothermalis LVYTFLAARLRPQAFLKIAPY FINGVT -—-IBL LTLFIGQGTEISS 
Enterococcus faecalis LSTVIIMHFSSKLLWRLT PVEWMALG-L LKFYDPVLAEQ-T| 


Thermus thermophilus 


Escherichia coli WAAC 5 FL 
Bacillus subtilis TAM Om DIAG TENG EM 
Staphylococcus aureus 1 ») 5 
Streptococcus pneumoniae Q G ILM 
Deinococcus geothermalis : F 3 : y TFGLGT TAMIYA 
Enterococcus faecalis ] 0 5 FLAT FGGYF LM 
i 
Thermus thermophilus PWRHELVGLFALALLV P------------------- 


SWRLILGVAVVLVAAF TI PILWFF---- 
NWKLTATTAGSGTLLTS-LILLVMINFP-DVAKSVG— SASNETOE 
TWRILAPAFTALFGIGS-STILLMIFKPSVIEKVSG— YAY S—— 
TWRILAPIFITVAVLGS-SIILAITYKPSLIEKTLH-— YAYS-— 
RULING TGEVLALGLIAIPRPLGR==S—s<-ssS5S=5-5- YLEKHSMILE FF GH] RGETETV 
TWKILV PVE I LAALVGAGT LYLITTETGRDLLSKLG------ WEAMKE FHT D== 


Escherichia coli 

Bacillus subtilis 
Staphylococcus aureus 
Streptococcus pneumoniae 
Deinococcus geothermalis 
Enterococcus faecalis 


250 ae 


Thermus thermophilus ia 
Escherichia coli — 


1s th ophil Q 
1 ili DS|POWDI 
Staphylococcus aureus -SGD nT 
Streptococcus pneumoniae -NGD Li 


Bacillus subtilis DSN 
Deinococcus geothermalis = G---LDO[JGMAHRD LISEC GUIs 
Enterococcus faecalis -PDRSBOPALAL THREE F GMESNV SD----VERBVRES 


300 310 

! i] 
Thermus thermophilus LFA ILACPRL SDRIRFLSGFAGMLGFOVV| 
Escherichia coli ee AARAQTT-------- 2 ae ee 
Bacillus subtilis a omic Hoc SFFCVGYTALIVIHTFQ 
Staphylococcus aureus LEG@HLIRMATTTRHP —-NVJF I 1GYLSEVLFHVVQ 
Streptococcus pneumoniae LIFHLIRLAMKTTSS FLIGYISLLVFHELQ 
Deinococcus geothermalis WLS TALHWSQLATRVRPMTPQIHGAT TGAMFMVVGQAF) 
Enterococcus faecalis LIYRMIRVCFDTNNE iY AYMATGHIMMILFHVFE| 


Thermus thermophilus ani ---------------- 

Escherichia coli SHJH---THRKMLSKSV--------— 

Bacillus subtilis YNASVOQLTKYRSYLFNS--------- 

Staphylococcus aureus  —- RAIBWSLMAGVg-----— ILLS TYYHEKEKFTESDQKADI 

Streptococcus pneumoniae Eg ogWoLNEG) eT ~~ VILSTYYHQKPASKDA-~~—~--— 

Deinococcus geothermalis HSAMREVRRHLPAAEASPDLVPVAAD 

Enterococcus faecalis = EV ERNE VE LIMSMRYQQETVRIRSGR--—-— 
Extended Data Figure 3 | RodA sequence conservation. The results of respectively. Secondary structure elements are shown above the alignment 
an alignment of 506 RodA sequences from diverse bacterial taxa, with on the basis of the T: thermophilus RodA crystal structure and JPRED 
representative examples displayed. Residues with 98%, 80% and 60% analysis of the portions of ECL4 that were not modelled in the structure. 


similarity across all 506 sequences are shown in black, grey and light grey, 
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Extended Data Figure 4 | RodA evolutionary couplings. a, The refined couplings, respectively. b, The partially disordered ECL4 in the refined 
crystal structure of T. thermophilus RodA (shown in light blue) is in close structure is strongly coupled evolutionarily to the transmembrane 
agreement with its evolutionary couplings. Green, yellow and red lines domain of RodA. A representation of predicted intra- and inter-domain 
between residues represent regions of the structure that are less than 5 A, evolutionary couplings for ECL4 is mapped onto a EVfold model (shown 
between 5 and 10A, and greater than 10 A of the predicted evolutionary in grey). 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


Viewed from 
intracellular side 


R oa 
lle340 


Pa 
Ala341 
Leu5 


> Ala344_/ 
a 
LeuS7— 


90° 


LETTER 


Viewed from 
extracytoplasmic side 
Variable Average Conserved 


(Ss 
Gin105 


SS 


~ GIn310) 
Cytoplasmic 


Viewed from 
extracytoplasmic side 


Extended Data Figure 5 | Sequence conservation of T. thermophilus represent disordered residues 189-227 and 237-251 in ECL4. b, The 
RodA. a, Surface and ribbon representation of RodA (top and bottom, bound lipid (shown as spheres) is surrounded by many aliphatic amino 
respectively). Analysis was performed using Consurf, coloured in a scale acids (shown as sticks). c, Extracytoplasmic view of the water-filled central 
from teal (poorly conserved) to magenta (highly conserved). A bound cavity and its proximity to the bound lipid molecule. 


lipid modelled as monoolein is shown in yellow sticks. The dashed lines 
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Extended Data Figure 6 | Circular dichroism spectroscopy analysis of For each panel, the circular dichroic spectra of wild-type and the 
purified B. subtilis RodA variants. Circular dichroism measurements indicated mutant RodA are shown in black and blue, respectively. The 
of wild-type RodA as well as RodA(D280A), RodA(E117K/K120E) corresponding T. thermophilus numbering for each mutant is shown in 
and RodA(I262S) indicate that the overall folds of all four forms are parentheses. Experiments were repeated independently twice with similar 
comparable and display characteristic «-helical peaks at 208 and 222 nm. results. 
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Extended Data Figure 7 | Mutagenic analysis of B. subtilis RodA 
function in vivo. Micrographs of B. subtilis strains harbouring an IPTG- 
inducible allele of wild-type rodA, and wild-type or mutant P,y14-rodA. 
Expression was induced by growing cells in the presence of 10\1M IPTG 
and 10mM xylose. The bacterial cytosol is indicated by intracellular 
mCherry expression (Ppen-mCherry). Mutants in the central cavity show 
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particularly deleterious phenotypes in this dominant negative assay. 
Experiments were repeated independently 2-3 times with similar results. 
A mutation made in the predicted RodA-bPBP interface does not prevent 
peptidoglycan polymerization in vitro (lower left panel), representative of 
two independent experiments. 
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Extended Data Figure 8 | Evolutionary coupling analysis for the 

RodA-PBP2 complex. Evolutionary co-variation maps highlighting 693 
couplings for PBP2 (top left panel), and 362 couplings for RodA (bottom 
right panel) using a 95% confidence threshold. These maps display good 
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correlation to the crystal structure of RodA and homology models of 
PBP2. The top right and lower bottom panels depict the 19 predicted inter- 
protein contacts between RodA and PBP2 using the same 95% confidence 
threshold. 
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Extended Data Table 1 | Data collection and refinement statistics 


T. thermophilus RodA 
Wild-type 


T. thermophilus RodA 
D255A 


Data collection 
Space group 
Cell dimensions 


a, b, c (A) 


a, B, y (°) 
Resolution (A) 
Reym (%) 
<I /o (l)> 
CC 1/2 (%) 
Completeness (%) 


Multiplicity 


Refinement 

Resolution (A) 

No. reflections 

Rwork ! Riree (%) 

No. atoms 
Protein 
Solvent ions/lipids 
Water 

Average B-factors (A’) 
Protein 
Lipids 
Solvent ions 
Waters 

R.m.s. deviations 
Bond lengths (A) 
Bond angles (°) 


Ramachandran Statistics 


Favored (%) 
Allowed (%) 
Outliers (%) 


C2 


122.4, 80.0, 47.8 


90.0, 91.1, 90.0 
40.0 - 2.9 (3.1 -2.9)* 
10.6 (147.9) 
6.69 (0.69) 
99.7 (45.8) 
98.4 (98.8) 
3.02 (3.15) 


40.0 - 2.91 (2.99 - 2.91) 
18684 (1851 in test set) 
22.9 / 27.4 


2229 
44 
52 


104.2 
109.6 
105.4 
87.6 


0.005 
0.825 


94.5 
5.5 
0.0 


Each dataset was collected from a single crystal. 
«Values in parentheses are for the highest-resolution shell. 
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C2 


121.2, 79.2, 47.4 
90.0, 91.1, 90.0 
40.0 - 3.2 (3.4 - 3.2)* 
17.1 (204.9) 
4.08 (0.44) 
99.6 (41.7) 
99.6 (99.5) 
3.43 (3.35) 


40.0 — 3.19 (3.31 — 3.19) 
13756 (1360 in test set) 
27.7 / 30.2 
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Structure of the insulin receptor-insulin complex 
by single-particle cryo-EM analysis 
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The insulin receptor is a dimeric protein that has a crucial role 
in controlling glucose homeostasis, regulating lipid, protein and 
carbohydrate metabolism, and modulating brain neurotransmitter 
levels'”. Insulin receptor dysfunction has been associated with many 
diseases, including diabetes, cancer and Alzheimer’s disease!**, The 
primary sequence of the receptor has been known since the 1980s”, 
and is composed of an extracellular portion (the ectodomain, ECD), 
a single transmembrane helix and an intracellular tyrosine kinase 
domain. Binding of insulin to the dimeric ECD triggers auto- 
phosphorylation of the tyrosine kinase domain and subsequent 
activation of downstream signalling molecules. Biochemical and 
mutagenesis data have identified two putative insulin-binding sites, 
S1 and $2°. The structures of insulin bound to an ECD fragment 
containing S1 and of the apo ectodomain have previously been 
reported”®, but details of insulin binding to the full receptor and 
the signal propagation mechanism are still not understood. Here 
we report single-particle cryo-electron microscopy reconstructions 
of the 1:2 (4.3 A) and 1:1 (7.4. A) complexes of the insulin receptor 
ECD dimer with insulin. The symmetrical 4.3 A structure shows 
two insulin molecules per dimer, each bound between the leucine- 
rich subdomain L1 of one monomer and the first fibronectin-like 
domain (FnIII-1) of the other monomer, and making extensive 
interactions with the a-subunit C-terminal helix (a-CT helix). 
The 7.4A structure has only one similarly bound insulin per 
receptor dimer. The structures confirm the binding interactions at 
S1 and define the full S2 binding site. These insulin receptor states 
suggest that recruitment of the a-CT helix upon binding of the first 
insulin changes the relative subdomain orientations and triggers 
downstream signal propagation. 

The insulin receptor is a dimer of heterodimers that comprises two 
a-chains and two 3-chains°, represented as (3). The a-chain and 
approximately 190 residues at the N-terminus of the 3-chain are located 
on the extracellular side of the plasma membrane (Fig. 1a) and together 
constitute the full insulin receptor ECD. The remainder of the 3-chain 
includes a transmembrane helix, the juxtamembrane domain and the 
intracellular tyrosine kinase domain. Structures of the individual sub- 
domains that make up the ECD (leucine-rich (L)1 and L2, cysteine-rich 
(CR) and fibronectin type III domains FnII-1, FnIII-2 and FnllI-3) 
have been resolved, and the crystal structure of the apo ECD$ provides 
one view of their quaternary organization. In the apo ECD structure, 
density for the insertion domain (120 residues located within FnIII-2, 
containing the cleavage site that generates the a- and 6-chains and 
the «-CT helix) was poor, and only a portion of the «-CT helix was 
visible?'°. As only one (a3) heterodimer is present in the asymmetric 
unit of the insulin receptor crystal structure, a symmetry-generated 
tetramer, (a3), has been hypothesized as representing the biologi- 
cally relevant form of the ECD®. Hereafter, we refer to each individual 


(a8) heterodimer as the insulin receptor monomer, and to the (a8) 
tetramer as the insulin receptor dimer. 

Current models of insulin binding to the insulin receptor suggest that 
there are two interaction sites on insulin that engage with two binding 
sites ($1 and $2) on the receptor’’. Binding is thought to occur in trans, 
with each insulin molecule interacting with S1 of one insulin receptor 
monomer and $2 of the other monomer. The bridging of these receptor 
binding sites constitutes the high-affinity interactions, whereas low- 
affinity interactions arise from single-site occupancy. The high-affinity 
interaction has a Kg of 6-200 pM for the solubilized and affinity- 
purified full-length receptor, whereas the low-affinity interactions have 
Kg values of 6nM (for $1) and 400.nM (for $2)!*. Upon insulin binding 
to the full-length receptor, there are multiple binding events and disso- 
ciations and/or multiple rearrangements within low- and high-affinity 
binding poses'”. Receptor activation is caused by high-affinity binding; 
however, the activation curve is complex and shows anti-cooperative 
binding’’. 

Here we report the 4.3 A resolution structure of the insulin recep- 
tor ECD in the presence of insulin, obtained using single-particle 
cryo-electron microscopy (cryo-EM) (Fig. 1b, c). A pronounced 
preferred orientation of the complex in vitreous ice was overcome 
with a combination of tilted data collection" and fast plunge speeds 
using the Spotiton instrument!'*. Further details are provided in the 
Methods. Three-dimensional classification of the cryo-EM data iden- 
tified a dimeric structure with two populations, in an approximately 
4:1 ratio (Fig. 2, Extended Data Fig. 1). The larger of these popula- 
tions, class 1, is characterized by a symmetrical ‘head’ and a poorly 
resolved, apparently flexible ‘stalk. Refinement of the class 1 structure, 
after masking out flexible portions of the stalk and applying C2 symme- 
try, provided a reconstruction to 4.3 A (Extended Data Figs 1, 2). The 
electron density map enabled unambiguous identification of the L1, 
CR, L2 and FnII-1 receptor domains, as well as the clear positioning 
of the insulin molecule and the a-CT helix (Fig. 1b, c). Refinement of 
the data using Cl symmetry produced a lower resolution map (4.7 A; 
Extended Data Fig. 3) in which additional density, assigned to FnIII-2, 
was visible for one monomer (Extended Data Figs 4a, 5b). 

The class 1 structure shows two insulin molecules, symmetrically 
bound, per dimeric receptor. Although asymmetric binding was pre- 
dicted for the full-length receptor’, studies show that the soluble ECD 
binds two molecules of insulin, each with a Kg of 3.5nM and a fast 
dissociation rate'”. The symmetrical binding of both insulin molecules 
observed in the cryo-EM map is consistent with these data. Each insulin 
molecule is located between the L1 domain of one receptor monomer, 
the FnIII-1 domain of the other monomer, and the «-CT helix (Fig. 1c). 
Residues of the L1 subdomain and the a-CT helix, previously identified 
biochemically as essential for insulin binding'’, have been hypothesized 
to represent the S1 site!®. Structures of ‘S1 microreceptors’ (containing 
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Figure 1 | Structure of the insulin receptor dimer. a, Domain 
organization of the full-length insulin receptor. The inter-monomer 
disulfide bonds**”? are shown as black lines; the intra-monomer ‘signalling 
bridge’*® is shown as an orange line. TM, transmembrane domain; 

TK, tyrosine kinase domain. b, Cryo-EM density map for class 1 with the 
insulin receptor subdomains fitted to the density; one monomer is shown 
in yellow, the other is colour coded as in a. Density indicated by the red 
arrows can be attributed to the insulin and «-CT helix. c, Close-up of 
density with insulin and a-CT helix fitted. 


L1, CR and a portion of the a-CT) in complex with insulin or insulin 
mimetics’ provided the first atomic details of some key insulin receptor- 
insulin interactions (Extended Data Table 1). In these structures, insulin 
interacts directly with the L1 subdomain of the receptor through only a 
small region of the B helix, and most of the other residues in the receptor 
that have been identified as essential for insulin binding (Extended Data 
Table 1) interact with the a-CT helix. Predicted interactions involving 
the C-terminal end of the insulin 3-chain® were also structurally visu- 
alized (Extended Data Table 1). The cryo-EM structure confirms the 
engagement of S1, including the interactions involving the C terminus 
of the insulin 3-chain, which is now resolved. The C-terminal region 
of the B-chain in the bound form assumes a different conformation 
from that observed in the insulin-free form (for example, Protein Data 
Base (PDB) ID: 1ZNI; Extended Data Fig. 4c), allowing the 3-chain 
core residues Gly8-Cys19 to interact with the a-CT helix, confirming 
the so-called detachment model of insulin binding'®. Although it is 
not possible to determine which receptor monomer each a-CT helix 
originates from using the current map, the fact that each insulin 
molecule interacts with residues belonging to both monomers supports 
the trans-binding-mode hypothesis. 

Analysis of the cryo-EM structure shows that $2 (Extended Data 
Table 1) is defined by interactions between the insulin receptor 
sequences Pro495—Arg498 and Arg539-Asn541 (located within the 
FnILH-1 domain) and chain B of insulin (residues Gln4—Gly8 and 
His10). Earlier work suggested that these insulin residues interacted 
with the S2 binding site®, but no partner residues were assigned on the 
receptor. The present structure shows interactions between the a-CT 
helix residues Leu696-Lys703 and insulin receptor residues Gly346- 
Asn349 and Arg372-Tyr374 (in the L2 subdomain) and Arg498- 
Leu501 and Val570-Thr571 (in the FnIII-1 subdomain). Previous work 
had correctly predicted that residues in L2 and FnIH-1 make up the S2 
site”. Clinical mutations (S323L, F382V, K460E and N462S, all located 
within or near the L2 and FnIII-1 subdomains discussed above) result 
in impaired insulin binding or reduced signalling”!-**. Further evidence 
that these regions represent S2 comes from ref. 25, in which a truncated 
receptor formed from the L1, CR, L2 and FnIII-1 subdomains fused 
to the «-CT helix was shown to have insulin-binding properties that 
are identical to those of the full ECD. A second possible location for 
the S2 site (at the junction of the FnIII-1 and FnIII-2 domains) was 
hypothesized from the crystallographic structure of the insulin-free 
insulin receptor ECD dimer®. Mutagenesis data provided some support 
for this assignment! 1. however, in the un-symmetrized map (Extended 
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Figure 2 | Class 1 and class 2 dimers. a, Side and top views of the two 
classes identified during 3D reconstruction. b, Cryo-EM map for class 2, 
showing density for the a-CT helix and insulin only in one monomer. 


Data Figs 4a, 5b) in which the FnIII-2 is visible, the bound insulin is 
more than 50 A from this predicted S2 site (Extended Data Fig. 4b), on 
the opposite side of the receptor. 

Refinement of the structural data for the smaller class 2 produced 
a74A map (Fig. 2, Extended Data Figs 1, 6). In this map, half of the 
‘head’ has an overall conformation similar to that observed in the class 1 
map, with visible densities for the a-CT helix and insulin. The other half 
showed the L1-CR-L2 portion in a different conformation (Fig. 2a): 
more open, and characterized by less-well-defined density. The model 
of the insulin receptor based on the class 1 structure was rigid-body 
fitted onto the class 2 map, and only one insulin molecule and one 
a-CT helix were included in the final structure. The open side 
lacked both the insulin and the helix (Fig. 2b). This conformation 
therefore comprises both an insulin-bound receptor monomer and an 
insulin-free receptor monomer. Comparison of these structures sug- 
gests that insulin binding induces both the closing of the top portion 
of the receptor (by rigid-body motion of the L1-CR-L2 portion with 
respect to the FnIII-1 domain) and recruitment of the a-CT helix, 
contrary to a previously suggested mechanism in which insulin docks 
to a preformed ‘harbour’ containing both the L1 and a-CT elements 
that are required for binding”®. 

The insulin receptor dimer identified in the cryo-EM analysis does 
not resemble the crystallographic symmetry-generated dimer®. The 
relative arrangement of the two monomers differs between the cryo-EM 
and crystallographic dimers, and transitioning between them would 
require major conformational changes and disruption of extensive 
surface interfaces, specifically the interactions between L1 and FnIL- 
2’ and their dimeric symmetry mates, and between L2, FnIII-1/ and 
their dimeric symmetry mates (Fig. 3a and Extended Data Fig. 7). 
However, the overall conformation of the crystallographic monomer 
(PDB ID: 4ZXB) is similar to that of the unbound monomer observed 
in the class 2 cryo-EM structure (Fig. 3b), suggesting that it could be 
a biologically relevant representation of the unbound insulin receptor 
monomer. Attempts to obtain a cryo-EM reconstruction for the ECD 
in the absence of insulin (the apo state) proved unsuccessful as the 
particles showed a high degree of heterogeneity and many were smaller 
than expected, suggesting that the ECD dimer may be unstable in the 
absence of the ligand. Comparing the unbound monomer from the 
crystal structure to the insulin-bound monomer from the cryo-EM 
structure (Extended Data Fig. 8), it appears that the conformational 
change that occurs upon insulin binding can be described as two 
rotations. The first is a 35° rotation of the L2-CR-L1 domains with 
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Figure 3 | Proposed transduction mechanism. a, Comparison between 
the cryo-EM (insulin bound) and the crystallographic (apo) dimers. Both 
dimers are shown as surface representation of the coordinates. Similar 
conformational differences between unbound and insulin-bound insulin 
receptor have been reported*”. b, Side and top views of the monomer 
(PDB ID: 4ZXB) overlaid onto the cryo-EM ‘ope’ monomer using the 
FnIII-1 domain. c, Schematic of a possible activation mechanism for the 
insulin receptor. The insulin receptor subdomains, colour coded as in 
Fig. 1a (solid colours for the a-chain and lighter colours with thicker outer 
lines for the 8-chain). Dotted lines, inter-monomer disulfide bonds; solid 
orange line, the signalling bridge’®. Binding of one insulin molecule to 
the apo receptor (left to middle panel) causes the LI-CR-L2 subdomains 
of one monomer, the FnIII-1 subdomain of the other and the a-CT helix 


respect to the linker connecting the L2 and FnIII-1 domains (resi- 
dues Ala466-E469); the second is a 55° swing of the CR-L1 pair (as 
a rigid body) around the Gly306-Lys310 linker between the CR and 
L2 domains (not resolved in the current structure). These domain 
movements are similar to the rotation and translation observed upon 
ligand binding in the related EGFR family’’. On the basis of the com- 
parison between unbound and bound receptors, it appears that the 
a-CT helix moves between approximately 55 A (if we consider the helix 
from the same monomer) and approximately 70 A (if we consider the 
a-CT from the other monomer). Figure 3c shows a schematic of a 
possible mechanism for insulin receptor transduction. It is likely that, 
following the binding of the first insulin molecule, the large shift in the 
position of the «-CT helix causes conformational changes in the inser- 
tion domain (ID)-a of both monomers. ID-« is disulfide bonded to 
FnIII-3 (Cys647—Cys860) and this connection (the ‘signalling bridge®) 
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Monomer from 4ZXB 
Unbound monomer from cryo-EM class 2 


to move to generate the binding site. The movement of the a-CT helix 
and the attached ID-«a causes, via the signalling bridge, a conformational 
change in the FnIII-3 domain. Because the two ID-a regions are also 
disulfide bonded, the movement of one is likely to be transmitted to the 
other, inducing a similar conformational change in the other FnIII-3 
domain. These changes propagate through the transmembrane helix to the 
tyrosine kinase domains, inducing autophosphorylation and activation 
of the signalling pathway. This state is seen in the cryo-EM class 2 map. 
Right, binding of a second insulin molecule recruits the second a-CT 
(cryo-EM class 1 map) and may fully stabilize the activated complex. 
Although the diagram suggests that the a-CT helix involved in insulin 
binding is the one from the same monomer (cis interaction), there is no 
evidence, to our knowledge, that rules out a trans interaction. 


may induce further conformational changes in the FnII-3 domain, 
thus triggering the downstream signal propagation. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Recombinant proteins. Recombinant human insulin receptor-CD220 (R&D 
systems, 1544-IR-50/CF) was reconstituted in PBS pH 7.4 at 0.3 mg/ml 
(0.00293 mM), stored in 100-1 aliquots at —80°C and used without further 
purification. The protein was tested for insulin binding using !*°I-insulin com- 
petition and surface plasmon resonance (Extended Data Table 2). The measured 
binding potency and binding affinities were consistent with values from the 
literature'”*!. A Coomassie-stained SDS-PAGE gel (Extended Data Fig. 5) of the 
sample shows the expected bands for the «- and 3-chains. The apparent molecular 
weights of both chains were higher than predicted (82.9 kDa for the a-subunit and 
22.9kDa for the 3-subunit), and the 8-subunit appears as three discrete bands. This 
has previously been observed!”>*"8 and can be attributed to glycosylation (and to 
different states of glycosylation for the 8 subunit). The gel confirms the integrity 
of the sample used in the studies. 

Recombinant human insulin (RHI) was obtained from the laboratories of 
Merck. RHI is a commercially available material; a generic version of the bioprocess 
used in preparing it has been described**. A stock solution (5.6 mg/ml, 0.969 mM) 
was made in HBS (10mM Hepes pH 7.4, 150mM NaCl) and stored at 4°C. 
Insulin receptor ECD competition binding assay. Purified His-tagged human 
insulin receptor ECD (R&D Systems) was bound to an anti-mouse IgG coated 
microplate via attachment to a mouse monoclonal anti-human insulin receptor 
capture antibody (R&D Systems). Varying concentrations of RHI were incubated 
with the insulin receptor ECD in the presence of 0.1 nM !*I-insulin (PerkinElmer, 
catalogue NEX420050UC) overnight at 4°C in binding buffer (100 mM HEPES, 
100mM NaCl, 10mM MgCh, 0.02% Triton X-100, pH 8). Plate washings with 
binding buffer were carried out after the antibody and protein capture steps and 
following ligand incubation. The amount of radioligand still bound to the insulin 
receptor ECD was determined with either a TopCount or a MicroBeta instru- 
ment using Microscint-40. Concentration response curves were generated and ICs 
values were calculated using a four-parameter fit. 

Insulin receptor ECD direct binding assay. Purified His-tagged human insulin 
receptor ECD (R&D Systems) was immobilized to a Biacore CM5 chip via an 
anti-His antibody kit (GE Healthcare Life Sciences) following the manufacturer's 
instructions. RHI binding to the immobilized human insulin receptor ECD was 
assayed by passing varying RHI concentrations in running buffer (1x HBS-EP; 
10mM HEPES, 150mM NaCl, 3mM EDTA, 0.005% P-20, pH 7.3) over the chip. 
General detection and data collection parameters for the Biacore T200 instrument 
were used to determine ko,, Korg and Kg values. 

Grid preparation. Samples for grid preparation were generated by mixing 10011 
insulin receptor solution and 3 1] insulin stock (for a final insulin receptor:insulin 
ratio of 1:10) followed by incubating on ice for at least one hour before making 
vitrified grids. Poor grid reproducibility and difficulty in obtaining suitable ice 
thickness presented issues when using manual grid preparation. To address 
these issues, the samples were vitrified using a semi-automated Spotiton V1.0 
robot!>!635, a device for preparing cryo-EM samples that uses piezo-electric 
dispensing to apply small (50-pl) drops of sample across a ‘self-blotting’ nanowire 
grid as it flies past en route to plunging into liquid ethane. The nanowire grids, 
manufactured in house, used either lacey carbon or lacey gold supports**; the gold 
supports help to optimize image quality when acquiring data with the grid tilted 
relative to the electron beam!*””. Tilted data collection was necessary to address 
highly preferred particle orientation observed during initial data collections. For 
the tilted datasets, nanowire gold substrate grids were plasma cleaned for 10s (with 
O, and H;). The sample was dispensed onto these grids in 50-pl drops for a total of 
~5nl sample dispensed in a stripe from top to bottom across each grid, before the 
grid was plunged into liquid ethane. The time between sample application to the 
grid and the plunge into ethane was typically ~300 ms. Subsequently, we modified 
the SpotItOn instrument to further reduce the time between sample application 
and plunge to ~170 ms. This modification was undertaken in an attempt to reduce 
the time the sample has to interact with the air—water interfaces in the thin liquid 
film before vitrification. These interactions are assumed to be the cause of preferred 
particle orientations*, and indeed the faster plunging speed resulted in grids that 
displayed markedly better particle distribution (data not shown), enabling the 
collection of data at zero-degree tilt. Nanowire-based, self-blotting grids with a 
lacey carbon supporting substrate were used for this purpose, following the same 
sample preparation methods described above. 

Data collection and processing. Tilted data were collected on a Titan Krios 
(Thermo Fisher) equipped with an energy filter and a Gatan K2 Counting camera. 
The microscope was operated at 300 kV and a nominal magnification of 105,000x, 
with a calibrated pixel size of 1.1 A. The defocus ranged from —1 to —2.5j1m. 
Images were collected at a tilt angle of —30° to address the issue of preferred orien- 
tation identified during initial data collection’. Exposure was set to 10s (40 frames 
per image), for a total dose of ~68e~ A~. A total of 6,805 images was collected 


in three sessions using Leginon”’. Of these, 3,056 images were selected as suitable 
for further data processing. Frames were aligned using MotionCor2"; global and 
per-particle CTF was calculated using gCTF"". Particle picking was done using 
Gautomatch (http://www.mrc-Imb.cam.ac.uk/kzhang/) and resulted in 1,206,222 
particles. All subsequent data processing (from 2D classification to final recon- 
struction) was done using Cryosparc™. After several cycles of 3D classification, 
151,409 particles in one class were used to generate a 4.6 A map with C2 symmetry 
(anda 5.6 A map with Cl symmetry). A Cl symmetry map was generated at 7.4A 
resolution from 48,315 particles in a second class (Extended Data Fig. 1). Fewer 
than 20% of the originally selected particles contributed to the final 3D maps. This 
was caused firstly by very liberal initial picking criteria that resulted in a high rate 
of false positives that were rejected during initial 2D classification (approximately 
60%). Secondly, there is a possibility that a substantial fraction of the receptor is 
not complexed, or alternatively, that there is a fast equilibrium between bound and 
unbound species that shows up as heterogeneity, limiting the contribution of these 
particle to the 3D structure. 

Untilted data were collected on a Titan Krios (Thermo Fisher) equipped with 
an energy filter and a Gatan K2 Counting camera. The microscope was operated 
at 300kV and a nominal magnification of 105,000 x, with a calibrated pixel 
size of 1.1 A. The defocus ranged from —1 to —2.5\1m. Exposure was set to 10s 
(40 frames per image), for a total dose of ~66 e~ A~. A total of 2,487 images was 
collected in three sessions using Leginon®’. Of these, 1,526 images were selected 
as suitable for further data processing. Frames were aligned using MotionCor2"; 
global and per-particle CTF was calculated using gCTF"". Particle picking was 
done using Gautomatch (http://www.mrc-lmb.cam.ac.uk/kzhang/) and produced 
348,930 particles. All subsequent data processing (from 2D classification to final 
reconstruction) was done using Cryosparc™. After several cycles of 3D classifica- 
tion, a set of 40,390 particles was identified, which produced a 4.7 A class 1 map 
after symmetrized homogeneous refinement. This set was merged with the 151,409 
particles from the tilted class 1 set. After a round of heterogeneous refinement, 
147,436 particles were used to generate a 4.3 A map with C2 symmetry (and a 
4.7 A map with Cl symmetry) (Extended Data Fig. 1). The resolution of each 
reconstruction was determined by the gold standard Fourier shell correlation (FSC) 
criterion in Cryosparc” and RELION PostProcess*. The completeness of the maps 
was assessed by measuring the Euler angle orientation distribution (as calculated 
by Cryosparc’”) and by evaluating the 3D FSC" of each map. (Extended Data 
Figs 2, 3, 6). The quality of the map was confirmed by the ability to see separated 
6-strands and the presence of bulky side chains (Extended Data Fig. 2f). 
Structure solution. Individual domains of the insulin receptor (L1, CR, L2 and 
FnIII-1, from PDB ID: 4XZB) were positioned into the initial 4.6 A resolution 
map using Molrep* and rigid-body refined using Coot’. The insulin and the 
a-CT helix were positioned by overlaying the microreceptor structure from PDB 
ID 3W11’ onto the L1 domain of the complex and rigid body fitting them into the 
available density. The a-CT helix was manually extended, while the full insulin 
molecule was built using as a reference one of the monomers from PDB ID 1ZNI. 
The full structure was subjected to five cycles of global real-space refinement with 
NCS, rotamer, Ramachandran plot and C-beta deviations restraints enabled in 
Phenix*®. This model was subsequently refined against the 4.3 A map following the 
same real-space refinement procedure in Phenix“®, The resulting model is identi- 
cal to the 4.6 A model, but for the addition of the first seven N-terminal residues 
(His1-Val7) to both receptor monomers, and contains residues His1-Asp591 and 
Lys691-Val720 (a-CT helix) of both of insulin receptor monomers and residues 
Gly1-Lys21 of chain A and Phe1-—Ala30 of chain B of both bound insulins. Sugars 
were added to residues Asn16, Asn25, Asn111, Asn255, Asn397 and Asn418 of 
both insulin receptor monomers. The sugars were extracted from PDB ID 4ZXB. 
All subsequent, lower resolution models were generated by manually positioning 
the higher resolution structure onto the available density and rigid-body refining 
the individual domains to their final position using Coot*®. Subsequently, the struc- 
tures were subjected to five cycles of global real-space refinement with rotamer, 
Ramachandran plot and C-beta deviations restraints enabled in Phenix“®. The 
lack of definition for the FnIII-2 and especially FnIII-3 domains may be due to the 
intrinsic flexibility of these regions. While one FnIII-2 region was clearly visible 
in the un-symmetrized map, unstructured density that could be attributed to the 
remaining missing regions can be seen by contouring the maps at very low sigma 
(Extended Data Fig. 5). 

Figure 2a and Extended Data Figs Ic, 2e, 3e, 6e were generated with Chimera?’. 

All other figures, unless otherwise specified, were generated with PyMOL (The 
PyMOL Molecular Graphics System v.1.8). 
Data availability. The maps and coordinates generated and analysed during the cur- 
rent study have been deposited with the Electron Microscopy Data Bank and Protein 
Data Bank under accession codes EMD-7462 (PDB 6CE9), EMD-7463 (PDB 6CEB) 
and EMD-7461 (PDB 6CE7) for the 4.3 A, 4.7 A and 7.4A maps, respectively. 
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Extended Data Figure 1 | Data collection and processing. 


a, Representative micrograph of the insulin receptor-insulin complex. 


Images were collected on a Krios (Thermo Fisher) equipped with an 
energy filter and a Gatan K2 Counting camera; the magnification was 
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set to 105,000 x, with a calibrated pixel size of 1.10 A.b, Representative 
2D class averages as calculated with Cryosparc™. c, Schematic of data 
processing. 
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Extended Data Figure 4 | Insulin binding and crystallographic S2 site. position of the bound insulin (blue) and the proposed S2 site (shown as 
a, Map of the class 1 structure obtained without applying C2 symmetry, spheres, residues selected according to ref. 26: the insulin is between 55 
with the insulin receptor model shown as a cartoon. The map is and 62 A away, and on the opposite side of the proposed S82). c, Overlay of 
asymmetric and only one of the FnIII-2 subdomains is clearly visible. free insulin (PDB ID: 1ZNI) onto the insulin bound to the insulin receptor. 


b, Positioning of the FnII-2 subdomain enables analysis of the relative 
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Extended Data Figure 5 | Insulin receptor sample characterization. was as reported by R&D Systems. For gel source data, see Supplementary 
a, Coomassie-stained SDS-PAGE gel. The protein was solubilized in PBS Fig. 1. b, Cryo-EM density for the un-symmetrized map (4.7 A resolution) 
at pH 7.2 and run on a 4-12% Bis-Tris gel in 1x MOPS. Molecular weight countered at 60. Density that can be attributed to the second FnIII-2 
markers are labelled. Bands corresponding to the «- and 8-chains are domain (blue arrow) as well as one of the FnIII-3 domains (red arrow) is 
indicated. This experiment was only run once to confirm sample quality visible, but it is not of sufficient quality for building the model. 
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Crystallographic apo dimer 


Extended Data Figure 7 | Comparison between the crystallographic 
and the cryo-EM dimer. a, The top panel shows three different views of 
the cryo-EM dimer, related by the 90° rotation shown in the figure. These 
are surface (for one monomer) and cartoon (for the other) representations 
of the coordinates fitted to the un-symmetrized, 4.7 A map. In this model 
one of the FnIII-2 subdomains is visible. Both monomers are coloured 
according to the subdomains (L1, blue; CR, red; L2, yellow; FnIII-1, green; 
FnlII-2, light blue). The bottom panel shows three corresponding views for 
the crystallographic dimer. One monomer is shown as surface, the other as 
a cartoon; both are coloured according to the subdomains (in the cryo-EM 
dimer, the additional FnIII-3 is coloured in light purple). The relative 
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arrangements of the two monomers in the cryo-EM and crystallographic 
dimers are completely different. b, If the crystallographic insulin receptor 
dimer (left panel) represents the biological apo insulin receptor dimer, the 
transition to the insulin-bound dimer (right panel) would imply a putative 
intermediate state, shown in the middle panel. In this state, the large 
conformational changes required to accommodate the a-CT helix (orange 
ribbon) and insulin (white ribbon), and to allow them to engage the S2 site 
(black spheres) would require disruption of the extensive surface interface 
between L1 and FnIII-2’ and 3’ and L2-FnIII-1’ interactions. In the middle 
and right panels the «-CT helix and the insulin are from the cryo-EM 
structure. 
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Extended Data Figure 8 | Comparison between the bound and unbound __ the insulin receptor monomer. This transition can be described as two 
insulin receptor monomer. Transition from the apo (magenta, PDB rotations with respect to the linker regions identified by the black arrows. 
ID: 4ZXB monomer) to the bound form (green, cryo-EM structure) for The a-CT helix is shown as a cartoon. 
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Extended Data Table 1 | Summary of the interactions between the insulin receptor and bound insulin 


: : IR L1 : IR L2 IR Fnill-1’ 
nisulin Kasi ates (IR monomer A) aie doadllaa (IR monomer A) | (IR monomer B) 
L62, F64, F88, F89, 
Y91, V94, F96, F701, F705 


R188 


po R14, 34,036, F88] 708] 


V3A, G8B, L11B, 
V12B DOF, FFU 


F714 


subsequently visualized in another study*®; green: new interactions identified in the cryo-EM map. 
Insulin residues predicted to belong to S2, with no IR partner assigned®. 
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Extended Data Table 2 | Human insulin receptor ECD binding and kinetic data 


['251]-Insulin Competition Biacore Binding: Biacore Binding: 
Binding: High Affinity Site Low Affinity Site 
IC55 (nM) Ka (nm Ka (nm) 


Biacore Binding: Biacore Binding: Biacore Binding: Biacore Binding: 
High Affinity Site High Affinity Site Low Affinity Site Low Affinity Site 
Kon (nM a) Kot ( s') Kon (nM s*') Kote ( s*') 


0.0016 0.0044 0.00061 ar ee 
(1.1) (1.1) (1.3) (1.3) 


Top, human insulin receptor ECD binding data for recombinant human insulin. Data are reported as the geometric mean for 23 replicates; the geometric standard deviation is shown in parentheses. 
The binding potency measured with the !25l-insulin competition assay agrees very well with values from the literature!”. Bottom, human insulin receptor ECD binding kinetic data for RHI. Data are 
reported as the geometric mean of three replicates; the geometric standard deviation is shown in parentheses. The surface plasmon resonance data are consistent with values from the literature*?. 
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Modular assembly of the nucleolar pre-60S 


ribosomal subunit 


Zahra Assur Sanghai!*, Linamarie Miller", Kelly R. Molloy*, Jonas Barandun!, Mirjam Hunziker!, Malik Chaker-Margot!?, 


Junjie Wang”, Brian T. Chait? & Sebastian Klinge! 


Early co-transcriptional events during eukaryotic ribosome 
assembly result in the formation of precursors of the small (40S) and 
large (60S) ribosomal subunits'. A multitude of transient assembly 
factors regulate and chaperone the systematic folding of pre- 
ribosomal RNA subdomains. However, owing to a lack of structural 
information, the role of these factors during early nucleolar 60S 
assembly is not fully understood. Here we report cryo-electron 
microscopy (cryo-EM) reconstructions of the nucleolar pre-60S 
ribosomal subunit in different conformational states at resolutions 
of up to 3.4 A. These reconstructions reveal how steric hindrance 
and molecular mimicry are used to prevent both premature folding 
states and binding of later factors. This is accomplished by the 
concerted activity of 21 ribosome assembly factors that stabilize 
and remodel pre-ribosomal RNA and ribosomal proteins. Among 
these factors, three Brix-domain proteins and their binding partners 
form a ring-like structure at ribosomal RNA (rRNA) domain 
boundaries to support the architecture of the maturing particle. 
The existence of mutually exclusive conformations of these pre-60S 
particles suggests that the formation of the polypeptide exit tunnel 
is achieved through different folding pathways during subsequent 
stages of ribosome assembly. These structures rationalize previous 
genetic and biochemical data and highlight the mechanisms that 
drive eukaryotic ribosome assembly in a unidirectional manner. 

The assembly of the large eukaryotic ribosomal subunit (60S) is 
organized as a series of consecutive intermediates, which are regu- 
lated by different sets of ribosome assembly factors in the nucleolus, 
nucleus and cytoplasm!. Three major types of pre-60S particles have 
been observed, each with a distinct composition of assembly factors, 
representing late nucleolar’, nuclear? and late nucleo-cytoplasmic 
intermediates*. The earliest pre-60S particles exist in the nucleolus, 
where they undergo major RNA processing steps and conformational 
changes. Subsequently, in the nucleoplasm, the internal transcribed 
spacer 2 (ITS2) RNA is removed and the particles are assembled into a 
nuclear-export-competent state. Exported particles then complete the 
final steps of maturation in the cytoplasm. 

While previous high-resolution structural studies have revealed the 
architectures of the late nucleolar and export-competent pre-60S par- 
ticles, the architecture of the early nucleolar particles, containing spe- 
cific factors such as Nsal, remains unknown”*. Although the identities 
and approximate binding regions of many early nucleolar ribosome 
assembly factors are known, their structures and functions have not 
yet been determined’. 

To elucidate the mechanisms that govern early nucleolar large-subunit 
assembly, several laboratories, including ours (data not shown), have 
observed that cellular starvation extends the lifetime of a 27SB pre- 
rRNA-containing species”. Here, we have isolated these intermediates 
from starved yeast using tandem-affinity purification involving tagged 
ribosome assembly factors Nsal and Nop2 (Extended Data Fig. 1). 


Similar to the small subunit processome, the nucleolar pre-60S par- 
ticle, which is compositionally related to Nsal-containing particles®, 
accumulates upon starvation. We analysed purified nucleolar pre-60S 
particles by cryo-EM, revealing a high-resolution core (3.4 A) that was 
further sub-classified into three states, which we structurally resolved at 
resolutions of 4.3 A (state 1), 3.7 A (state 2) and 4.6 A (state 3) (Table 1, 
Extended Data Figs 2, 3). These three states, together with cross-linking 
and mass spectrometry data, resulted in the identification of 21 ribo- 
some assembly factors (Extended Data Table 1 and Supplementary 
Dataset 1). Atomic models could be completed for 18 of these proteins, 
and homology and poly-alanine models were used for proteins such as 
Ebp2, Mak11 and Ytm1, which were located in more flexible regions 
(Fig. 1, Extended Data Fig. 4). 

The purified pre-60S particles contain 27SB rRNA that has not yet 
been processed at ITS2 (Extended Data Fig. 1). We observed three 
different conformational states of this RNA (Extended Data Figs 2, 5). 
State 1 includes ordered density for ITS2, domains I and H, and the 
5.88 rRNA. State 2 additionally revealed density for domain VI, which 
is present in a near-mature conformation. In contrast to state 2, state 3 
lacks an ordered domain VI but features domain III. Although present, 
the majority of domains IV and V and the 5S ribonucleoprotein (RNP) 
were poorly resolved in all of the reconstructions, owing to confor- 
mational flexibility. Low-resolution features corresponding to parts of 
domain V (helices 74-79) and its proximal assembly factor Mak11 can 
be seen in states 2 and 2A (Fig. 1, Extended Data Fig. 2). 

A striking feature of the nucleolar pre-60S particle is its open 
architecture, in which the solvent-exposed domains I, II and VI are 
encapsulated by a series of ribosome assembly factors as visualized in 
state 2 (Fig. 1, Extended Data Fig. 5, Supplementary Video). Notably, 
ribosomal proteins that have been associated with Diamond-Blackfan 
anaemia’ are located at critical rRNA domain interfaces in the structure 
(Extended Data Fig. 6), suggesting that their architectural roles are 
especially important during early nucleolar assembly, in which defects 
can trigger the nucleolar stress response. 

Domains I, I and VI adopt an open conformation that is chaperoned 
by eight early ribosome assembly factors, which form a ring-like struc- 
ture at the solvent-exposed side (Fig. 2a). In particular, Brix-domain- 
containing factors (Brx1, Rpfl and Ssf1) act in conjunction with their 
respective binding partners (Ebp2, Mak16 and Rrp15) to interconnect 
these junctions and sterically prevent premature RNA-protein and 
RNA-RNA contacts. Architectural support for the major interface 
between domains I and II is provided by Rpfl1 and its zinc-binding 
interaction partner Mak16, the helical repeat protein Rrp1 and the 
beta-propeller Nsal. Rpfl and Nsal occupy a region near the domain 
I binding site of Rpl17, while Mak16 and Rrp1 interface predominantly 
with ribosomal proteins Rpl4 and Rpl32 within domain II (Fig. 2b). 

The ring-like structure encapsulating domains I, II and VI is con- 
tinued in one direction by the Ssfl-Rrp15 heterodimer and Rrp14. 
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Table 1 | Cryo-EM data collection parameters and refinement and 
validation statistics 


State 2 State 3 
EMD-7324 EMD-7445 
Core PDB6COF State2A PDB6CB1 
Data collection and processing 
Magnification 22,500 x 
Voltage (kV) 300 
Pixel size (A) 13 
Electron exposure (e~ A?) 47 
Defocus range (j1m) 1.0-3.5 
Symmetry imposed C1 
Initial particle images 1,653,290 
Final particle images 514,746 201,114 75,512 31,419 


Resolution (A) 3.4 3.7 4.2 4.6 


FSC threshold 0.143 

Map sharpening B-factor (A?) 68.7 TL7 83.0 94.2 
Refinement 

Model composition 

Non hydrogen atoms 91,741 69,945 
Protein residues 7,197 6,347 
RNA bases 1,602 1,615 
Ligands 3 2 
rm.s.d. 

Bond length (A) 0.007 0.006 
Angles (°) 1.11 1.10 
Validation 

MolProbity score 1.91 1.68 
Clashscore 10.67 5.96 
Rotamer outliers (%) 0.30 0.13 
Good sugar puckers (%) 98 oF 
Ramachandran 

Favoured (%) 94.75 94.85 
Allowed (%) 5.18 5.02 
Outliers (%) 0.07 0.13 


While the long C-terminal helix of Rrp14 bridges domains I and 
VI, the Ssfl-Rrp15 complex is positioned at the interface of domains 
land VI (Fig. 2a). Here, Ssf1 occupies the same position as Rp131, 
which in the later Nog2-containing pre-60S particles binds at the 
interface of domains III and VI near the polypeptide exit tunnel 
(PET)*. The PET, which is created by domains I, III and VI at the 
solvent-exposed side, is already formed in the Nog2 particle, where 


a Back view 
Rrp1 


Figure 1 | Structure of the early nucleolar pre-60S particle. a, Composite 
cryo-EM density map of state 2, consisting of 25S rRNA domains I and II 
(3.4 A) and VI (3.7 A) (low-pass filtered to 5 A), and associated proteins. 
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it is blocked by the C-terminal domain (CTD) of the GTPase Nog] 
(Fig. 2c, d). 

The role of the Brx1-Ebp2 heterodimer near the interface of domains 
Iand II is twofold. As well as being involved in the stabilization of these 
domains, its strategic binding site prevents the premature assembly of 
the large subunit by steric hindrance. In later stages of large subunit 
assembly, an RNA segment of domain I (helix 22) base-pairs with a 
region in domain V (helix 88) near a separate region of domain IV 
(helix 68) (Fig. 2e, f). Brxl1 remodels helix 22 of domain I to block the 
premature formation of this tertiary structure with helix 88. Similarly, 
Brx1 prevents the mature conformations of domain IV (helix 68), 
domain II (expansion segment 9) and the C-terminal region of Rpl13 
in this region (Fig. 2e, f). 

In state 3, we identified the Erb1-Ytm1 heterodimer bound to 
domain III via Rpl27 (Fig. 3a). The N-terminal region of Erb1 (residues 
239-397) wraps around the entire ITS2-domain I interface and is posi- 
tioned underneath Nop16 and Has! (Fig. 3b, c). This location is con- 
sistent with previous cross-linking data and explains why deletions in 
this region prevent the incorporation of Erb] into pre-60S particles!" 
Nop 16 interconnects RNA elements of the 5.8S rRNA and regions of 
domain I. Additionally, Nop16 interacts with both Rpl8 and Rpl13 
(Fig. 3c). The DEAD-box helicase Has] is positioned at the interface of 
Rpl8, Cicl, Nop16 and Erbl (Fig. 3c). The assembly factors Cic1, Rlp7, 
Nop7 and Nop15 appear in both the nucleolar pre-60S particle and the 
Nog? particle in largely the same conformation (Fig. 3c, d). 

Notably, the N-terminal segment of Erb1 employs molecular 
mimicry by binding to Nop7 ina similar fashion as Nop53 in the Nog2 
particle, which uses a structurally related motif to bind to Nop7. The 
resulting steric hindrance is exacerbated by the alternate conforma- 
tion of the N terminus of Rlp7, which further prevents Nop53 binding 
(Fig. 3e, f). Therefore, the coordinated mechanical removal of Erb1 
and its proximal factors Ytm1, Nop16 and Has1 by Mdn1 is required 
before Nop53 can bind to the Nog? particle and recruit the exosome- 
associated RNA helicase Mtr4 for ITS2 processing’”!*. The Has] hel- 
icase may have acted upon its substrate at an earlier stage during the 
27SA3-to-27SB transition. Alternatively, it may remodel flexible RNA 
elements in its vicinity for the ensuing 27SB processing". 

The nucleolar pre-60S states 2 and 3 represent distinct assembly 
intermediates of the polypeptide exit tunnel (Fig. 4). Ssfl, Rrp15 and 
Rrp14 are ordered in state 2, where they chaperone domains I and 


Front view 


b, Corresponding near-atomic model of state 2 with ribosome assembly 
factors and 25S rRNA domains labelled and colour-coded. Ribosomal 
proteins are shown in grey. 


5 APRIL 2018 | VOL 556 | NATURE | 127 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


Figure 2 | A ring of nucleolar assembly factors prevents premature 
folding of the 25S rRNA. a, b, Assembly factors chaperone areas 

of domains I, II and VI and interact with ribosomal proteins. 

c, d, Assembly factors prevent the formation of the polypeptide exit 
tunnel (PET). ¢, In state 2, Ssfl-Rrp15 blocks the binding of Rpl31. Nog1 
is largely unstructured. d, In the Nog2 particle (PDB 3JCT), the PET is 
formed, probed by Nog] and supported by Rpl131. e, f, Brxl-Ebp2 and 
an associated peptide (Brx1-AP) remodel domain I (helix 22) to prevent 
binding of domain V (helix 88) and domain IV (helix 68) (e); 

these interactions are present in the Nog? particle (f). 


VI, which line two sides of the forming PET (Fig. 2b, c). By contrast, 
domain VI, Ssfl, Rrp15 and Rrp14 are disordered in state 3. Here, Ytm1 
and Erb] chaperone domain II, which adopts a mature conformation 
with respect to domain I to form a different intermediate of the PET 
(Fig. 3a, b). A subsequent maturation step of states 2 and 3 is likely to 
involve the joining of domains III and VI and the formation of the PET 


State 2 


State 3 


Figure 4 | Model of early nucleolar stages of large subunit assembly. 
Domains of 25S rRNA are represented as separate segments that are 
flexible (in grey) or stably incorporated (in colour). State 1 represents 
an intermediate in which domains I and II are ordered and subsequent 
folding of either domain III or VI will occur (state 3 and state 2, 
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State 2 b 


Front view Back view 


Rrp14 
State 3 


CTD Ytm1 


Figure 3 | Molecular mimicry by Erb1 prevents premature ITS2 
processing. a, Cryo-EM maps of states 2 and 3 of the nucleolar pre-60S 
particle. State 2 contains domain VI but limited density for domain IH, 
while state 3 contains domain III with a flexible domain VI. NTD, 
N-terminal domain. b, Two views of the state 3 cryo-EM map. 

c, d, Cartoon representation of the ITS2 region in state 3 of the nucleolar 
pre-60S particle (c) and the Nog? particle (PDB 3JCT) (d). e, f, The 

N termini of Erb1 and Rlp7 in the nucleolar pre-60S particle prevent the 
binding of Nop53 (e), which binds to Nop7 in the Nog2 particle (f). 


on the solvent-exposed side. This may be accompanied by the insertion 
of the Nog] N terminus into the nascent PET and the replacement of 
Ssfl-Rrp15 by Rpl31 (Fig. 2c, d, 4). 

Conformational changes, first by the initially flexible domain V-5S 
RNP together with the Nog] GTPase domain, and subsequently by 
domain IV, would result in the overall conformation observed in the 
late-nucleolar Nog? particle” (Fig. 2c, d, 4). The base-pairing between 


Nog2 particle 


Intermediate states 


respectively) to partially form the polypeptide exit tunnel (PET). To 
mature into the Nog2 particle, the formation of the nascent PET must 
occur by Mdn1- and Rix7-dependent exchange and removal of assembly 
factors alongside stepwise folding of domain V and then domain IV. 
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METHODS 


Purification of nucleolar pre-60S particles. Nucleolar pre-60S particles were 
purified from a Saccharomyces cerevisiae BY 4741 strain containing a TEV 
protease-cleavable C-terminal GFP tag on Nsal (Nsal-40aaLinker-TEV-GFP) 
and a C-terminal 5x beta-catenin 3C protease-cleavable tag on Nop2 (Nop2- 
40aaLinker-3C-Bc5) for endogenous expression. Cultures were grown in full 
synthetic drop-out (SD) medium containing 2% raffinose (w/v) at 30°C to an 
optical density of 0.8-1, before addition of 2% galactose (w/v) for 16h, reaching 
saturation (optical density 5-6). Cells were then harvested by centrifugation at 
3,000g for 10 min at 4°C. The cell pellet was washed with ice-cold ddH,O twice, 
followed by a wash with ddH,O containing protease inhibitors (E64, pepstatin and 
PMSF). Washed cells were immediately flash frozen in liquid nitrogen and lysed 
by four cycles of cryogenic grinding using a Retsch Planetary Ball Mill PM100. 

The freshly ground yeast powder was resuspended by vortexing in buffer A 
(50mM Tris-HCl, pH 7.6 (20°C), 150mM NaCl, 1mM EDTA, 1mM DTT, 0.1% 
Triton-X100, PMSF, pepstatin, E-64). The insoluble fraction was removed by cen- 
trifugation at 4°C, 40,000g for 30 min. The supernatant was subsequently incu- 
bated with anti-GFP nanobody beads (Chromotek) for 3h at 4°C, with agitation. 
The beads were washed four times in ice-cold buffer A before the bound proteins 
were eluted using TEV-protease cleavage (1h, 4°C). The eluate was then incubated 
with NHS-sepharose beads (Sigma) coupled with anti-beta-catenin nanobody”” 
in buffer B (50mM Tris-HCl pH 7.6 (20°C), 150 mM NaCl, 1mM EDTA, 1mM 
DTT) for 1h at 4°C with agitation. For electron microscopy sample preparation, 
the anti-beta catenin beads were washed once with buffer B. Cleavage by 3C pro- 
tease for 1h at 4°C released the Nsal-Nop2 containing nucleolar pre-60S parti- 
cles. For protein-protein cross-linking analysis the eluate from GFP-nanobody 
beads was incubated with beta catenin-nanobody beads in buffer C (50 mM 
HEPES-NaOH pH 7.6 (4°C), 150mM NaCl, 1 mM EDTA) and eluted in the same 
buffer by 3C-protease cleavage. The eluate typically measured an absorbance at 
260 nm (A260) of 2.4-4.5 milli absorbance units (mAU) (Nanodrop 2000, Thermo 
Scientific) (Extended Data Fig. 1). 

Cryo-EM sample and grid preparation. Cryo-EM grids were prepared on four 
different occasions for the four datasets obtained (ds1-ds4). The nucleolar pre-60S 
particle eluate in sample buffer B (above) was left as is (ds1 only), or supplemented 
with 0.1% Triton X-100 and 5mM MgCl (final concentration, ds2-ds4). Copper 
grids of 400 mesh with lacey carbon and an ultra-thin carbon support film were 
used (Ted Pella, product no. 01824) for data collection. A volume of 3-411 nucle- 
olar pre-60S particle sample (absorbance at 260 nm of 2.5 mAU) was applied onto 
glow-discharged grids and plunged into liquid ethane using a Vitrobot Mark IV 
robot (FEI Company) (100% humidity, blot force of 0 and blot time 3.5-4s). 
Cryo-EM data collection and image processing. A total of 14,201 micrographs 
were obtained over four data collections (ds1-ds4) on a Titan Krios (FEI Company) 
at 300kV with a K2 Summit detector (Gatan). SerialEM”° was employed for 
data acquisition using a defocus range of 1.0-3.5,.m with a pixel size of 1.3 A. 
Super-resolution movies with 32 frames were collected using a total dose 
of 10 electrons per pixel per second with an exposure time of 8s and a total 
dose of 47 electrons per A? (Table 1). 

Upon data collection, the movies were gain corrected, dose weighted and 
aligned with Motioncor2 (ref. 21), and the contrast transfer function (CTF) was 
estimated using CTFFIND 4.1.5 (ref. 22). Relion 2.1 (ref. 23) was used for all sub- 
sequent particle picking, classifications and refinements. Corrected and aligned 
micrographs were first subjected to autopicking in Relion, resulting in a total of 
1,653,290 selected particles from all four datasets. After manual inspection of all 
micrographs, particles were extracted with a box size of 480 pixels (2 x -binned 
to 240 pixels), and 2D-classified separately for each individual dataset. After 2D 
classification, bad classes were removed, and the selected particles of each dataset 
were 3D-classified into four classes using an initial 3D model obtained from cry- 
oSPARC”4, low-pass filtered to 60 A. The best one-to-two classes from each 3D 
classification were selected and their particles were re-extracted with a box size 
of 480 pixels (un-binned). A combined total of 514,746 particles was finally used 
for 3D auto-refinement and post-processing with a solvent mask around a ‘core’ 
containing domains I and II of the 25S rRNA, resulting in an overall resolution of 
3.4A (Extended Data Figs 2, 3a). Since the more flexible domains III-VI were not 
visible in this state owing to averaging of different populations, a subsequent round 
of 3D classification without image alignment of these particles into six classes was 
used to obtain the two classes that contained state 2 (39%) and states 1 and 3 (41%). 
A refinement of the class containing state 2 (201,114 particles) was performed 
using a mask to include the additional visible densities, comprising of domain VI 
and its associated proteins, to obtain a final map at 3.7 A (Extended Data Figs 2, 
3b). By performing an additional round of focused 3D classification without image 
alignment on this class with a mask around Mak11 and the neighbouring segment 
of domain V of the 25S rRNA, a subset of particles (36%) emerged with improved 


density. This was refined to provide a more continuous map of Mak11 and domain 
V (state 2A) (Extended Data Figs 2, 3c). The remaining 64% of particles did not 
reveal additional features beyond those seen in state 2. 

Owing to particle heterogeneity in the initial class containing states 1 and 3 
(41%, 211,534 particles), a focused classification without image alignment of that 
class of particles was performed using a mask around the additional density con- 
taining domain III of the 25S rRNA, Erb1-CTD (WD40) and Ytm1 (WD40). The 
class from this round of classification containing density for domain III (31,419 
particles) was refined with a mask around the entirety of state 3, to obtain a final 
map at 4.6 A resolution (Extended Data Figs 2, 3d). The highest-resolution class 
of particles from this classification, which contained no density for domain III 
(126,824 particles), was refined to obtain the 4.3 A map of state 1. The map 
of state 1 closely resembles the higher-resolution core map (correlation 
coefficient = 0.96). The local resolutions of the maps were calculated using 
Resmap”° (Extended Data Fig. 3). 

Model building and refinement. By using the structure of the late nucleolar pre- 
60S particle—the Nog? particle (PDB 3JCT)?—as reference, common assembly 
factors and ribosomal proteins were manually located and fitted into the density, 
using Cicl and Rpl7 as hallmark anchors. Protein identification and tracing were 
aided by cross-linking and mass spectrometry analyses (described below). New 
assembly factors Mak16, Rrp1, Nop16, Erb1-NTD, Rrp14, Rrp15 and Ebp2 and 
segments of rRNA were modelled de novo. Previously determined crystal structures 
of assembly factors Nsal (PDB 5SUI)”®, Ytm1 (PDB 5CXB) and Erb1-CTD (PDB 
4U7A)*’ were docked and manually adjusted. Has1, Ssf1, Brx1, Rpfl and Mak11 
were initially docked from Phyre2 models”’, and manually built to fit the density. 
Model building was performed with Coot”. An annotated list of individual protein 
IDs, reference models and corresponding maps used for building can be found in 
Extended Data Table 1. The model was refined against a half-map1 from the overall 
3.7 A map of state 2 in PHENIX with phenix.real_space_refine using secondary 
structure restraints for proteins and RNAs”’. Refinement and model statistics can 
be found in Table 1. 

Map and model visualization. All map and model analyses and illustrations 
were made using Chimera*! and PyMOL Molecular Graphics System, v.1.8 
(Schrédinger). Density map visualization for certain figures was also performed 
on UCSF ChimeraX, developed by the Resource for Biocomputing, Visualization, 
and Informatics and the University of California, San Francisco (supported by 
NIGMS P41-GM103311). 

RNA extraction and northern blotting. The S. cerevisiae nucleolar pre-60S par- 
ticle (Nsal-Nop2 particle) was purified as described in ‘Purification of nucleolar 
pre-60S particles’ and RNA was extracted from the final 3C-protease elution with 
1 ml TRIzol (Life Technologies) according to the manufacturer’s instructions. 
Isolated nucleolar pre-60S RNA (1.0}1g) was separated on a denaturing 1.2% 
formaldehyde-agarose gel (SeaKem LE, Lonza) or ona denaturing 10% urea~PAGE 
(Fisher, Amresco) for the 5S rRNA northern blot. After staining the gel in 1 x SYBR 
Green II (Lonza) ddH0 solution (pH 7.5) for 30 min, RNA species were visualized 
with a Gel Doc EZ Imager (Bio-Rad) (Extended Data Fig. 1d, e) and then trans- 
ferred onto a cationized nylon membrane (Zeta-Probe GT, Bio-Rad) using down- 
ward capillary transfer for the agarose gel and a Trans-Blot SD semi-dry transfer 
cell (Bio-Rad) for the urea~PAGE gel. RNA was cross-linked to the membrane 
for northern blot analysis by UV irradiation at 254 nm with a total exposure 
of 120 mJ/cm? in a UV Stratalinker 2400 (Stratagene). Cross-linked mem- 
branes were incubated with hybridization buffer (750mM NaCl, 75mM 
trisodium citrate, 1% (w/v) SDS, 10% (w/v) dextran sulphate, 25% (v/v) 
formamide) at 65°C for 30 min before addition of 7-*?P-end-labelled DNA 
oligonucleotide probes. Oligonucleotide probe sequences were as follows: 
258, TTTCACTCTCTTTTCAAAGTTCTTTTCATCT; ITS1 3’ end, TT 
AATATTTTAAAATTTCCAG; ITS2 C2 site, TGGTAAAACCTAAAACGACCGT; 
3/-ETS 5’ end, CCACTTAGAAAGAAATAAAAA; and 58, CTACTCG 
GTCAGGCTC. 

Probes were hybridized for 1 h at 65°C and then overnight at 37°C. Membranes 

were washed once with wash buffer 1 (300 mM NaCl, 30mM trisodium citrate, 1% 
(w/v) SDS) and once with wash buffer 2 (30 mM NaCl, 3 mM trisodium citrate, 1% 
(w/v) SDS) for 20 min each at 45°C. Radioactive signal was detected by exposure 
of the washed membranes to a storage phosphor screen which was scanned with a 
Typhoon 9400 variable-mode imager (GE Healthcare). For northern blot source 
data, see Supplementary Fig. 1. 
DSS cross-linking sample preparation and mass spectrometry analysis. The 
tandem-affinity purified nucleolar pre-60S particles (Nsal-Nog2 particle), eluted 
off anti-beta catenin nanobody beads (in 50 mM HEPES-NaOH pH 7.6 (4°C), 
150mM NaCl, 1mM EDTA, 1mM DTT) at an absorbance of 1.0 at 260nm 
(Nanodrop 2000, Thermo Scientific), were pooled (total volume, 300 11) and split 
into three 100-1 cross-linking reaction aliquots. 
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Disuccinimidylsuberate (DSS; 25 mM in DMSO, Creative Molecules) was added 
to each aliquot to yield a final DSS concentration of 2.0mM and samples were 
cross-linked for 30 min at 25°C with 450 r.p.m. constant mixing. The reactions 
were quenched with 50 mM ammonium bicarbonate (final concentration) and 
precipitated by adding methanol (Alfa Aesar, LC-MS grade) to a final concentra- 
tion of 90% followed by incubation at —80°C overnight. Precipitated cross-linked 
nucleolar pre-60S particles were combined into one tube by repeated centrifugation 
at 21,000g, 4°C for 30 min. The resulting pellet was washed three times with 1 ml 
ice-cold 90% methanol, air-dried and finally resuspended in 5011 1x NuPAGE 
LDS buffer (Thermo Fisher Scientific). 

DSS cross-linked samples were processed as in ref. 18, and as described 

below. DSS cross-linked nucleolar pre-60S particles in LDS buffer were reduced 
with 25mM DTT, alkylated with 100 mM 2-chloroacetamide, separated by 
SDS-PAGE in three lanes of a 3-8% tris-acetate gel (NUPAGE, Thermo Fisher 
Scientific), and stained with Coomassie blue. The gel region corresponding 
to cross-linked complexes was sliced and digested overnight with trypsin to 
generate cross-linked peptides. After digestion, the peptide mixture was 
acidified and extracted from the gel as previously described*”**. Peptides were 
fractionated offline by high-pH reverse-phase chromatography, loaded onto 
an EASY-Spray column (Thermo Fisher Scientific ES800: 15cm x 751m ID, 
PepMap C18, 341m) via an EASY-nLC 1000, and gradient-eluted for online 
electrospray ionization—-mass spectrometry (ESI-MS) and tandem mass 
spectrometry (MS/MS) analyses with a Q Exactive Plus mass spectrometer 
(Thermo Fisher Scientific). MS/MS analyses of the top 8 precursors in each full 
scan used the following parameters: resolution, 17,500 (at 200 Th); AGC target, 
2 x 10°; maximum injection time, 800 ms; isolation width, 1.4 m/z; normalized 
collision energy, 24%; charge, 3-7; intensity threshold, 2.5 x 10°; peptide match, 
off; dynamic exclusion tolerance, 1,500 mmu. Cross-linked peptides were identified 
from mass spectra by pLink*“. All spectra reported here were manually verified 
as described previously”. 
Data availability. Cryo-EM density maps for the yeast nucleolar pre-60S particle 
states 2 and 3 have been deposited in the EM Data Bank with accession codes 
EMD-7324 and EMD-7445, respectively. Atomic coordinates for the yeast nucle- 
olar pre-60S particle states 2 and 3 have been deposited in the Protein Data Bank 
under accession codes 6COF and 6CB1, respectively. A PyMOL session for the anal- 
ysis of the yeast nucleolar pre-60S particle in state 2 is available in Supplementary 
Dataset 2. 
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Extended Data Figure 1 | Purification of tagged Nsal-Nop2 
nucleolar pre-60S particles and analysis of RNA components. 

a, Schematic of tandem-affinity purification of the nucleolar pre-60S 
particle with tagged proteins Nsal and Nop2. b, Coomassie-blue stained 
SDS-PAGE of pre-60S particles purified as in a. Protein labels are based 
on in-solution mass spectrometry analysis of purified pre-60S particles 
and the approximate molecular weight. c, Schematic processing of the 
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large ribosomal subunit rRNAs in yeast. The locations of the previously 
published pre-60S particles (the Nog2 particles’, the Arx1 particle*’, the 
Rixl-Real particle’ and the Nmd3 particle*) are represented by blue bars. 
Binding sites of northern blot probes are indicated on the 35S and pre-5S 
transcript. d, e, Pre-rRNA was visualized on an agarose gel and stained 
using SYBR Green II. Northern blot analysis was performed for the 25S, 
ITS2, ITS1 and 3’ ETS RNAs (d) and for the 5S rRNA (e). 


d, part of Springer Nature. All rights reserved. 


LETTER 


14,201 micrographs 
1,653,290 particles 


Dataset 1 Dataset 2 Dataset 3 Dataset 4 
2,835 micrographs 3,403 micrographs 3,492 micrographs 4,471 micrographs 
344,291 particles 360,233 particles 494,948 particles 453,818 particles 


Initial 2D/3D classification 


92,948 particles 173,915 particles 152,267 particles 95,616 particles 


3D refinement 


core 
EMD-7324, additional map 1 
514,746 particles 
3.4A 
8D classification 
w/o image alignment 


6% 


Masked 3D refinement | 


3D classification 
w/o image alignment 


15% 17% 57% 11% 
Improved density Masked 3D refinement 3D refinement 
for domain V1 
~ state2 ~ 
EMD-7324 
201,114 particles 
3.7A 
3D classification Improved density 
w/o image alignment for domain III ' 
| | ! | “ 
tate 3 state 1 
9 209 20% 24° = 
36% Or ie ° EMD-7445 126,824 particles 
Masked 3D refinement erie) Pemicles 43A 
46A 
. Improved density 
ae : for Mak11/domain V 
state 2A 
EMD-7324, additional map 2 
75,512 particles 
4.2A 
Extended Data Figure 2 | Cryo-EM data-processing workflow. Four of 514,746 ‘good’ particles. These particles were refined to produce the 
data collections were performed, resulting in 14,201 micrographs. These core map. Further 3D classification without and with alignment was used 


micrographs were aligned using MotionCor2 (ref. 21) with dose weighting, to obtain the state 1, state 2, state 2A and state 3 maps. Density regions 
and imported into Relion2.1 (ref. 23) for further processing. Autopicking corresponding to domain III (red), domain VI (blue) and Mak11 (green) 
followed by manual cleaning, 2D and 3D classification produced a total are highlighted. 
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Extended Data Figure 3 | Overall and local resolution estimates for line), phase-randomized (solid grey line), masked (dashed grey line) 
core, state 2 and state 3 cryo-EM maps. a-d, Overall and local resolution and the corrected map (solid black line) are also shown. A thin black 
of core map at 3.4 A (a), state 2 map at 3.7 A (b), state 2A map with line indicates an FSC value of 0.143. A clipped view is shown next to two 
additional density for Mak11 at 4.2 A (c) and state 3 map at 4.6 A (d). views of the obtained cryo-EM maps. The density volumes are coloured 
Fourier Shell Correlation (FSC) curves for the unmasked (dashed black according to local resolution using Resmap”’. 
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Extended Data Figure 4 | Cryo-EM density fit of selected nucleolar of the 25S rRNA and 5.8S rRNA models and their cryo-EM density. Images 
pre-60S ribosomal subunit proteins and RNA models. a, Near-atomic generated in PyYMOL or Chimera. 
models of assembly factors and their cryo-EM density. b, Selected regions 
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Extended Data Figure 5 | rRNA domains of state 2, state 3 and the domains of the 25S rRNA are colour-coded and displayed in the crown 
mature 60S ribosomal subunit. The 5.8S rRNA, the 5S rRNA and the and back view for each structure. 
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Extended Data Figure 6 | Ribosomal proteins associated with between the 5.8S-domain I duplex and a domain I-domain II interface. 
Diamond-Blackfan anaemia are positioned at rRNA domain junctions c, Rpl33 (Rp135 in Homo sapiens) binds at the junction of domains I, II 

in the nucleolar pre-60S particle. a, Two views of the nucleolar pre- and VI in the 25S rRNA. d, Rp126 associates with the domain I-5.8S rRNA 
60S particle state 2 model, with Diamond-Blackfan anaemia-associated interface and additionally inserts its N terminus (N) between domain I 
ribosomal proteins shown in surface representation. b, Rpl15 is located and domain II. 
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Extended Data Figure 7 | Intermediates of nucleolar pre-60S assembly. 
The structural data presented in this paper (states 1, 2, and 3) are 
complemented by recent data on pre-60S assembly: states C (EMD-3893) 
and E (EMD-3891)!°. State 1 is highly similar to state A (EMD-3888)!°. 
States 2 and 2A correspond closely to state B (EMD-3889)'°, but state B 


unassigned 
IV helices 
Spb4 


ES27 


lacks the Ssfl-Rrp15-Rrp14 module and Mak11. Built assembly factors 
that become ordered or leave in subsequent particles are indicated 

with arrows. Two possible pathways are shown that result in the final 
incorporation of the DEAD-box helicase Spb4 (previously unidentified’). 


bh 
~ 180° 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


state 2 Kater et al., state E 


a, 


Rpli3 Has1-NTD 


Kater et al., state E 


Ae 


Nog2 particle 


2 


oe 


Extended Data Figure 8 | Steric hindrance during nucleolar pre-60S incorporation of Rp128. c, d, Comparative views of state E (c) and the Nog2 
assembly. a, b, Comparative views of state 2 (a) and state E(PDB6ELZ)'°(b), _ particle (PDB 3JCT) (d), highlighting that the presence of Spb1 prevents 
highlighting that the binding of the Brxl CTD to Rrp1 prevents premature __ the binding of Bud20. 
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Extended Data Table 1 | Molecular models of the nucleolar pre-60S ribosomal subunit 


Total 
Subgroup Chain ID| SegiD Molecule name residues or Modelled (residue range) Initial PDB template Present in... 
bases 


atomic (1,676 bases 
atomic (158 basi 


c (87 bas 


Rpl4A_uL4 atomic (2-56, 89-347) 3JCT All states 


Rpl6A_eL6 atomic (7-176) 3JCT All states 
Rpl32_eL32 atomic (7-36, 47-130) 3JCT All states 
Rp!l7A_uL30 atomic (3-244) 3JCT All states 
Rpi33A_eL33 atomic (2-107) 3JCT All states 
Rpi8A_eL8 atomic (53-239) 3JCT All states 
Rpl35A_uL29 atomic (2-120) 3JCT All states 
Rpl36A_eL36 atomic (17-100) 3JCT All states 
Rpli3A_eL13 atomic (22-127) 3JCT All states 
Rpl14A_eL14 atomic (11-138) 3JCT All states 
Rpli5A_eL15 atomic (2-68, 96-204) 3JCT All states 
hiboeomallproteine Rpli6A_uL13 atomic (3-59, 73-199) 3JCT All states 
Rpli8A_eL18 atomic (15-146) 4v88, chain BQ All states 
Rpl20A_eL20 atomic (2-172) 3JCT All states 
Rpl3_uL3 atomic (17-224, 270-385) 4v88, chain DB State 2 
Rpli7A_uL22 atomic (10-64, 80-126, 140-161) 3JCT All states 
Rpl23A_uL14 atomic (16-137) 3JCT State 2 
Rpl26A_uL24 atomic (2-127) 3JCT All states 
Rpl37A_eL37 atomic (14-85, Zn) 3JUCT All states 
Rpl27A_eL27 side-chain trimmed (2-136) 3JCT State 3 
Rpi38_eL38 side-chain trimmed (2-78) 3JCT State 3 
Rpl34A_eL34 side-chain trimmed (2-102) 3JCT State 3 
Rpl30_eL30 side-chain trimmed (9-105) 3JCT State 3 
Rpl25_uL23 side-chain trimmed (2-142) 3JCT State 3 
Cic1 atomic (31-51, 64-302) 3JCT All states 
Nop7 atomic (13-43, 61-267, 351-396, 404-460) 3JCT All states 
Nop15 atomic (88-220) 3JCT All states 
See ene Rlp7 atomic (54-105, 127-322) 3JCT All states 
particle (CT) Rip7 (NTD) poly-alanine (20-53) De novo State 3 
Rip24 atomic (2-130, Zn) 3JCT State 2 
Tif6 atomic (1-226) 3JCT State 2 
Nog1 atomic (373-470) 3JCT State 2 
Nsa1 atomic (1-78, 101-416) 5SUI All states 
Has1 atomic (42-252, 264-489) Phyre model based on 2V1X All states 
Brxt sgl Hehe SBE an -230), Model based on SWLC, chainSM _Allstates 
Ebp2 poly-alanine (196-269) De novo All states 
Rrp1 atomic (1-186, 197-253) De novo All states 
Mak16 atomic (2-191, Zn) De novo All states 
Rpf1 atomic (8-295) Phyre model based on 5JPQ, chain c All states 
New assembly 7 
facia Erb1-NTD atomic (239-298, 372-395), poly-alanine (299-371) De novo All states 


side-chain trimmed crystal structure 
(416-426, 428-534, 571-807) ate SHOE 
Ssf1 atomic (23-214, 324-356) Phyre model based on 4XV9 State 2 
Mak11 poly-alanine (WD40, 285 residues) Phyre model based on 3DMO States 2 and 2A) 
Rrp15 atomic (174-243) De novo State 2 
Ytm1 poly-alanine (C. thermophilum, 465 residues) 5CXB State 3 
Nopi6 atomic (1-83, 156-228) De novo All states 
Rrp14 atomic (296-393) De novo State 2 
Brx1-associated peptide unknown i De novo All states 


ndividual protein chains are listed with their initial PDB template (or built de novo) and the nucleolar pre-60S particle state(s) in which they are present. 
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Structural insights into the voltage and phospholipid 
activation of the mammalian TPC]1 channel 


Ji She!*, Jiangtao Guo**, Qingfeng Chen!?**, Weizhong Zeng!?*, Youxing Jiang!** & Xiao-chen Bai?* 


The organellar two-pore channel (TPC) functions as a homodimer, 
in which each subunit contains two homologous Shaker-like six- 
transmembrane (6-TM)-domain repeats!. TPCs belong to the 
voltage-gated ion channel superfamily” and are ubiquitously 
expressed in animals and plants**. Mammalian TPC1 and TPC2 are 
localized at the endolysosomal membrane, and have critical roles in 
regulating the physiological functions of these acidic organelles*’. 
Here we present electron cryo-microscopy structures of mouse 
TPC1 (MmTPC1)—a voltage-dependent, phosphatidylinositol 
3,5-bisphosphate (PtdIns(3,5)P)-activated Na*-selective channel— 
in both the apo closed state and ligand-bound open state. Combined 
with functional analysis, these structures provide comprehensive 
structural insights into the selectivity and gating mechanisms of 
mammalian TPC channels. The channel has a coin-slot-shaped ion 
pathway in the filter that defines the selectivity of mammalian TPCs. 
Only the voltage-sensing domain from the second 6-TM domain 
confers voltage dependence on MmTPC1. Endolysosome-specific 
PtdIns(3,5)P, binds to the first 6-TM domain and activates the 
channel under conditions of depolarizing membrane potential. 
Structural comparisons between the apo and PtdIns(3,5)P»- 
bound structures show the interplay between voltage and ligand 
in channel activation. These MmTPC1 structures reveal lipid 
binding and regulation in a 6-TM voltage-gated channel, which is 
of interest in light of the emerging recognition of the importance of 
phosphoinositide regulation of ion channels. 

TPC1 and TPC2 represent two major subfamilies of mammalian 
TPC channels and their functions are associated with various 
physiological processes, including hair pigmentation*!°, autophagy 
regulation!!!” blood vessel formation}, acrosome reaction in sperm', 
mTOR-dependent nutrient sensing!®, lipid metabolism'® and Ebola 
virus infection!’, to name a few. Mammalian TPCs were initially 
suggested to mediate nicotinic acid adenine dinucleotide phosphate 
(NAADP)-dependent calcium release from endolysosomes!*”°, 
However, several recent studies have demonstrated that mammalian 
TPCs are Nat-selective channels activated by endolysosome-specific 
PtdIns(3,5)P, rather than NAADP!*"!. The dual regulation of TPC2 by 
both PtdIns(3,5)P; and NAADP has also been reported”. Distinct from 
TPC2, mammalian TPC1 activation is voltage-dependent, conferring 
electrical excitability to the endolysosome”**. The atomic structure of 
a plant TPC1 from Arabidopsis thaliana (AtTPC1) was recently deter- 
mined by X-ray crystallography, revealing the overall architecture of 
the TPC family*>°. However, mammalian TPCs share low sequence 
identity with their plant counterparts (Extended Data Fig. 1) and 
exhibit different gating and selectivity properties. Here we present the 
structural and functional analysis of MmTPC1. 

When overexpressed in HEK293 cells, some MmTPC1 channels are 
trafficked to the plasma membrane, enabling us to directly measure 
channel activity by patching the plasma membrane (Extended Data Fig. 2 
and Methods). In brief, MmTPC1 activation requires both membrane 


depolarization and the PtdIns(3,5)P> ligand (Extended Data Fig. 2b, c). 
The voltage activation of MmTPC1 is modulated by endolysoso- 
mal luminal pH”, and a lower pH shifts voltage activation towards 
a more positive potential (Extended Data Fig. 2d, e). In our record- 
ings, MmTPCI1 exhibits higher selectivity for Nat than K* and Ca” 
(Extended Data Fig. 2f, g); this is different from plant TPC1, which is 
non-selective?”8, 

MmTPC1 structures were determined in the presence and absence of 
PtdIns(3,5)P, to a resolution of 3.2 and 3.4 A, respectively, using single 
particle electron cryo-microscopy (cryo-EM) (Fig. 1, Extended Data 
Figs 3, 4and Extended Data Table 1). The cryo-EM density maps of both 
structures are of sufficient quality for de novo model building of major 
parts of the protein (Extended Data Fig. 5). Here we use the higher- 
resolution PtdIns(3,5)P2-bound structure for the initial description 
of the overall structural features. Similar to AtTPC1, each MmTPC1 
subunit contains two homologous 6-TM domains (6-TMI and 6-TMII) 
and two subunits that assemble into a rectangle-shaped functional 
channel, which is equivalent to a tetrameric Shaker-like channel 
(Fig. la, b and Extended Data Fig. 6). Following the same nomenclature 
as other voltage-gated channels, we labelled the six transmembrane 
helices within each 6-TM domain as IS1-IS6 and IIS1-IIS6, respec- 
tively (Fig. 1c). The transmembrane region of MmTPC1 is domain- 
swapped; the $1-S4 voltage-sensing domain (VSD) from one 6-TM 
interacts with the S5-S6 pore domain from the neighbouring 6-TM 
(Fig. 1b). The pore domain of the second 6-TM contains a luminal loop 
between IIS5 and pore helix 1 (IIP 1) that forms an upright antenna-like 
8-hairpin; Asn600 and Asn612 on this luminal loop are glycosylated 
with visible density for the covalently linked N-acetylglucosamine 
moiety of the sugar” (Fig. 1b, e and Extended Data Fig. 5c). 

Multiple cytosolic components within each TPC1 subunit— 
including the N-terminal H1 helix, the linker between the two 6-TMs 
and the C-terminal post IIS6 region—assemble into a tightly packed 
cytosolic domain (Fig. 1d). Despite low sequence homology, the linker 
between the two 6-TMs adopts the EF-hand domain structure with 
two EF-hand motifs (EF-1 and EF-2), similar to plant TPC1, and the 
C-terminal portion of the exceptionally long IS6 serves as the E1 helix 
(Fig. 1d and Extended Data Fig. 6d). Ca** is unlikely to bind to the 
EF motifs of MmTPC1 as these motifs lack essential Ca**-chelating 
acidic residues (Extended Data Fig. 1). The N-terminal H1 helix is 
tightly packed with the EF-1 motif and becomes an integral part of 
the EF-hand domain (Fig. 1d). Compared with plant TPC1 and mam- 
malian TPC2, MmTPC1 has a much longer C-terminal region, which 
adopts a horseshoe-shaped structure with four a-helices and two 
68-strands and tightly wraps around the EF-hand domain (Fig. 1d and 
Extended Data Figs 1, 6d). 

The MmTPC1 ion conduction pore, which consists of $5, S6 and 
two pore helices, adopts a closed conformation in the apo structure 
and an open conformation in the PtdIns(3,5)P2-bound structure 
(Fig. 2a—d). In the apo structure, the four pore-lining S6 helices form 
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Figure 1 | Overall structure of MmTPC1. a, A 3D reconstruction of 
PtdIns(3,5)P2-bound (purple density) MmTPC1 dimer with each subunit 
in individual colour. b, Cartoon representation of MmTPC1 in the same 
orientations as the electron microscopy maps in a. N-acetylglucosamine 
(NAG) molecules and PtdIns(3,5)P2 (purple) are shown as sticks. 

c, Topology and domain arrangement of MmTPC1 subunit. d, Structure of 
the 6-TMI and the soluble domain, with individual elements coloured as 
inc. Inset, zoomed-in view of the cytosolic soluble domain. e, Structure of 
the 6-TMII. 


a bundle-crossing at the cytosolic side, with two layers of hydrophobic 
residues—the Leu317 and Phe321 from each IS6 helix, and the Val684 
and Leu688 from each IIS6 helix—that form the constriction points that 
prevent the passage of hydrated cations (Fig. 2b, c). In the PtdIns(3,5) 
P,-bound state, the S6 helices undergo outward movement along with 
rotational motion (Fig. 2d). Consequently, the constriction-forming 
residues dilate and rotate away from the central axis, resulting in a 
much wider opening at the intracellular gate (Fig. 2b-d). In addition, 
the side chains of four acidic residues (Asp322 from each IS6 and 
Glu689 from each IIS6) that point tangentially to the pore in the closed 
structure undergo inward rotation and face the ion conduction pathway 
in the open state (Fig. 2b-d), generating a ring of negative charges at the 
gate that could facilitate channel conductance. The molecular mecha- 
nism of PtdIns(3,5)P»-induced channel opening will be discussed later. 

The selectivity filter region remains identical in both structures. Two 
sets of filter residues, Thr280-Ala281—Asn282 (filter I) in 6-TMI and 
Val647-Asn648-Asn649 (filter II) in 6-TMII, enclose the central ion 
pathway with different dimensions (Fig. 2e, f). The residues of filter I 
line the pathway with predominantly main-chain backbone carbonyls 
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and have atom-to-atom cross distances of about 8A (Fig. 2e). The 
residues of filter II use side chains to generate a much narrower path- 
way, with two constriction points formed by the Asn648 and Asn649 
residues (Fig. 2e). Positioned at the centre of the filter and stabilized by 
hydrogen-bonding interactions with the filter I backbone carbonyls of 
Thr280 and Ala281, the Asn648 side chain forms the narrowest point 
along the filter pathway; it has a cross distance of about 3.7 A and has 
the central role in defining the Na* selectivity of MmTPC1 (Fig. 2e, f). 
Asn648Ala mutation results in a complete loss of Na* selectivity 
(Fig. 2g and Extended Data Fig. 7). The Asn649 residues are positioned 
at the luminal entrance of the channel at a wider distance from one 
another, and Asn649Ala mutation reduces but does not abolish Na* 
selectivity (Fig. 2g and Extended Data Fig. 7). With an elongated coin- 
slot-like ion pathway at the filter, Na* ions probably pass through the 
MmTPC1 filter in a partially hydrated form. The two Asn648 side 
chains are positioned to provide optimal coordination to stabilize the 
permeating Na’ ion, but are too close to permit the passage of K™. 
Thus, Asn648 forms a simple size sieve to exclude K™ or larger ions. 

The two VSDs (VSD1 and VSD2) have virtually the same structures 
as their respective counterparts between the apo and ligand-bound 
states and, therefore, the higher-resolution PtdIns(3,5)P2-bound struc- 
ture will be used in the discussion (Extended Data Fig. 8a, b). Figure 3a 
provides the numbering of the S4 gating charge residues (R1-R5) from 
TPCs and other canonical voltage-gated channels for comparison. 
Although VSD1 contains three arginine residues in IS4 (Arg200, 
Arg203 and Arg206, at positions R2, R3 and R4, respectively) (Fig. 3b), 
it lacks some key features of canonical voltage sensors (see Extended 
Data Fig. 8a legend) and does not contribute to voltage-dependent 
gating—similar to the VSD1 of plant TPC1”°—as shown by the fact 
that replacing these arginines individually with a neutral residue does 
not affect the voltage activation of MmTPC1 (Fig. 3c and Extended 
Data Fig. 8c). 

VSD2 contains only two S4 arginines (Arg540 at position R3 and 
Arg546 at position R5), preserves the key features of a canonical 
voltage sensor—including the 3)9-helix in IIS4 and the conserved 
gating-charge transfer centre (Fig. 3d)—and is responsible for the 
voltage gating of MmTPC1. Mutations of Arg540 and Arg546 have 
a profound but opposite effect on the voltage dependence of the 
channel. The Arg540Gln mutation stabilizes VSD2 in an activated 
state and yields a voltage-independent channel that has a linear 
current-voltage relationship between —100 and 50 mV and can 
be activated by PtdIns(3,5)P2 even at hyperpolarization (Fig. 3e). 
The Arg546Gln mutant, by contrast, can barely be activated by voltage 
even at high concentrations of PtdIns(3,5)P», as if the voltage sensor is 
trapped in the resting state (Extended Data Fig. 8d). 

The MmTPC1 VSD2 adopts an activated conformation with its final 
voltage-sensing arginine (Arg546) positioned in the gating-charge 
transfer centre formed by Tyr487 and Glu490 from IIS2 and Asp512 
from IIS3, and the other voltage-sensing residues (Arg540 and Gln543) 
facing the external, luminal side (Fig. 3d). The VSD2 of AtTPC1, with 
its $4 arginines residing at positions R3—RS5, is also responsible for the 
voltage-gating of the channel, and its structure is in the resting state”. 
We can, therefore, extrapolate the conformational change of MmTPC1 
VSD2 from the activated to the resting state by comparing its structure 
with that of AtTPC1 (Fig. 3f, g). Except for the S4 helix, the two struc- 
tures superimpose well. This suggests that upon hyperpolarization the 
IIS4 of MmTPC1 would slide down by about two helical turns with- 
out undergoing structural change in the rest of VSD2, and position 
its R3 arginine (Arg540) at the gating-charge transfer centre (Fig. 3g). 
Concurrent with the IIS4 sliding, the IIS4—-S5 linker would swing down- 
ward and move closer to IIS6. Notably, VSD2 is in the activated state 
in both the apo and PtdIns(3,5)P2-bound structures, indicating that 
the voltage sensor can be activated without opening the channel in 
MmTPCl. 

The bound PtdIns(3,5)P, can be unambiguously identified 
from the electron microscopy density map of the ligand-bound 
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Figure 2 | Ion conduction pore of MmTPC1. a, Ion conduction pore, 
comprising IS5-S6 (pore 1) and IIS5-S6 (pore 2). b, c, Side view of the 
bundle-crossing formed by IS6 (b) and IIS6 (c) in the apo closed (salmon) 
and PtdIns(3,5)P2-bound open (blue) states. Numbers are cross distances 
(in A) at the constriction points. d, Structural comparison of the cytosolic 
gate between the closed and open states viewed from the cytosolic side 

in three sections: Leu317/Val684 (left), Phe321/Leu688 (middle) and 
Asp322/G1u689 (right). e, Side view of the selectivity filter formed by 


structure (Extended Data Figs 5d, 9a). PtdIns(3,5)P» is situated at the 
junction formed by IS3, IS4 and the IS4-S5 linker of 6-TMI; its inositol 
1,3,5-trisphosphate head group is positioned on the cytosolic side and 
its acyl chains are inserted upright into the membrane (Fig. 4a and 
Extended Data Fig. 9a). Figure 4b summarizes the protein-ligand inter- 
actions, which involve predominantly basic residues from the C termi- 
nus of H1, the N terminus of IS3, the IS4—S5 linker and the C-terminal 
part of IS6. Buried deep in the protein, the two phosphate groups on 
the C1 and C3 positions of the inositol muster the majority of protein- 
ligand interactions and probably define the ligand specificity. The C5 
phosphate protrudes outwardly away from the ligand-binding pocket, 
and forms salt bridges with Lys87 and Lys331; the interaction with 
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Figure 3 | The voltage-sensing domains. a, Partial S4 sequence alignment 
and arginine registry. NavAb, Nav channel from Arcobacter butzleri. 

b, Side view of VSD1 with IS1 omitted for clarity. c, G/Gmax—V curves 

of wild-type MmTPC1 and IS4 arginine mutations. Sample traces are 
shown in Extended Data Fig. 8. All data points are mean + s.e.m. (n=5 
independent experiments). d, Side view of VSD2 with IIS1 omitted for 
clarity. e, Sample J-V curves of wild-type MmTPCl1 (obtained from the 
peak currents at various activation potentials) and Arg540GIn mutant 
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filter I and filter II with the front subunit removed for clarity. Numbers 
are cross distances (in A) at the constriction points. f, Top view of the 
selectivity filter. Inset, zoomed-in view of the filter with the stabilization 
H-bonds for Asn648 (dotted line) and electron microscopy density (grey) 
shown. g, Sample I-V curves of the filter mutations recorded with high 
Na‘ or K* in the bath solution. Pyax, permeability ratio of Na* to K*. 
Original traces are shown in Extended Data Fig. 7. The experiments were 
repeated five times independently with similar results. 


Lys331 participates in the coupling between the ligand and IIS6, and has 
an important role in the ligand activation of the channel. Among all the 
ligand-interacting residues at the PtdIns(3,5)P2-binding site, mutations 
of the residues that predominantly interact with the C3 phosphate— 
including the three arginines (Arg220, Arg221 and Arg224) on the 
1S4-S5 linker and Lys331 on IS6—appear to have the most profound 
effect on PtdIns(3,5)P> activation, which illustrates the central role of 
the C3 phosphate (Extended Data Fig. 9b). In a recent study, the three 
linker arginines have also been reported to be important for NAADP- 
mediated Ca”* release*”. 

To investigate the affinity and specificity of the ligand, we meas- 
ured the activity of the ligand-dependent channel in excised patches 


(obtained by applying a voltage pulse ramp from —100 to 100 mV). 
Currents were recorded with 2 1M PtdIns(3,5)P> in the pipette and 
repeated five times independently with similar results. f, Structural 
comparison of VSD2 between the PtdIns(3,5)P2-bound MmTPC1 
(orange) and AtTPC1 (cyan), with S1 helices omitted for clarity. 

g, Cartoon representation of VSD2 conformational change from the 
activated to resting state. Red arrows indicate the concurrent movements 
of $4 and $4-S5 linker. 
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Figure 4 | PtdIns(3,5)P2 binding in MmTPC1. a, PtdIns(3,5)P> binding in 
6-TMI of MmTPC1. Inset: zoomed-in view of the PtdIns(3,5)P 

site. b, Schematic of the protein-ligand interactions. c, Concentration- 
dependent PtdIns(3,5)P, activation of Arg540GIn mutant at —100 mV. 
Curve is least square fit to the Hill equation. Data points are mean + s.e.m. 
(n=5 independent experiments). Sample I-V curves are shown in 
Extended Data Fig. 9c. d, Ligand specificity of MmTPC1 measured 


by using the voltage-independent Arg540GIn mutant, which simplifies 
ligand-dependent gating by eliminating the voltage effect. The mutant 
also elicits much larger currents, which makes it suitable for inside-out 
patches. The PtdIns(3,5)P2-dependent activation of the mutant yielded 
a half-maximal effective concentration (EC59) of about 145nM (Fig. 4c 
and Extended Data Fig. 9c), similar to that of human TPC1 measured 
in whole lysosome patch”*. The PtdIns(4,5)P2 isoform cannot activate 
the channel or inhibit PtdIns(3,5)P, activation (Fig. 4d), indicating high 
lipid specificity of MmTPC1. The lack of PtdIns(4,5)P2-binding can 
be explained by the missing C3 phosphate and the close proximity of 
Asn85 and Lys87 to the C4 hydroxyl group, which sterically excludes 
the C4 phosphate and thereby prevents the binding of the lipid (Fig. 4e). 

Compared to the apo structure, PtdIns(3,5)P2-binding does not 
introduce major structural changes around the ligand-binding pocket 
(Extended Data Fig. 9d), except for one key conformational change 
on IS6 mediated by Lys331 (Fig. 4f). In the apo state, the Lys331 side 
chain points away from the ligand-binding pocket. In the presence 
of PtdIns(3,5)P2, the Lys331 side chain adopts an extended configu- 
ration to form salt bridges with both the C3 and C5 phosphates as 
well as a hydrogen bond with the C4 hydroxyl, pulling IS6 towards the 
ligand-binding pocket (Fig. 4f). This movement propagates to the other 
part of IS6, as well as IIS6, and opens the gate. Lys331 appears to be the 
only residue that couples IS6 to the bound PtdIns(3,5)P2 and its muta- 
tion to Ala completely abolishes PtdIns(3,5)P2 activation (Extended 
Data Fig. 9b). 

Our structures demonstrate that PtdIns(3,5)P2 only binds to the 
first 6-TM domain and directly introduces conformational changes 
in IS6 helix, whereas voltage influences only the VSD2 in the second 
6-TM domain, the conformational change of which is likely to affect 
the movement of IIS6 helix (Figs 3, 4). A global structural compari- 
son between the apo and PtdIns(3,5)P2-bound structures explains the 
interplay between the two stimuli (Fig. 5). Despite having an activated 
voltage sensor, the MmTPCI pore remains closed in the apo structure, 
implying that PtdIns(3,5)P»-binding is required to trigger the opening 
of the gate. Upon PtdIns(3,5)P2-binding, the ensuing tethering inter- 
action between Lys331 and PtdIns(3,5)P> straightens the IS6 helices 
that are initially bent at the n-helix just below the filter region in the 
closed state, resulting in the outward dilation and rotation at the bundle 
crossing (Figs 2d, 5a). The five-residue 1-helix is present only in IS6 
and may facilitate the helix bending. To open the pore, the IIS6 helices 
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using the Arg540GIn mutant. Sample J-V curves were recorded on the 
same patch with different PtdInsP isoforms. The experiments were 
repeated five times independently with similar results. e, Close proximity 
between the C4 hydroxyl of PtdIns(3,5)P> and the surrounding residues. 
f, Structural comparison at the region around Lys331 between the apo 
(green) and PtdIns(3,5)P:-bound MmTPC1 (salmon). 


also have to undergo concurrent outward and rotational movements 
to accommodate the PtdIns(3,5)P2-induced conformational change in 
1S6 helices, particularly the rotation of the two IS6 gating residues with 
large hydrophobic side chains (Leu317 and Phe321). Consequently, the 
two IIS6 gating residues (Val684 and L688) also rotate away from the 
central axis and open the gate (Figs 2d, 5b). The IIS6 motion is hinged 
around the residue immediately below the filter region, and is propa- 
gated to a much larger movement at the C-terminal end of IIS6, which 
swings upward and makes direct contact with the IIS4-S5 linker. Such 
motion is permitted only when IIS4 of VSD2 is in the activated, up state. 
Under hyperpolarized membrane potential, IIS4 is expected to slide 
downward and push the IIS4-IIS5 linker along with it, occluding the 
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Figure 5 | Gating mechanism of MmTPC1. a, b, Structural comparison 
between the apo closed (grey) and PtdIns(3,5)P2-bound open (green) 
MmTPC1 with zoomed-in views of the IS3-S6 (a) and IIS3-S6 (b) regions. 
Arrows indicate the S6 movements. Key gating residues are shown as 
sticks. IS6 contains a five-residue -helix (coloured red). c, Working model 
for voltage-dependent PtdIns(3,5)P2 activation of MmTPC1. Red arrows 
mark the direction of the driving force. 
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space necessary for upward IIS6 movement upon PtdIns(3,5)P. activa- 
tion (Fig. 3f, g). PtdIns(3,5)P2can probably still bind MmTPC1 under 
hyperpolarization, but the resting VSD2 prevents channel opening by 
blocking the movement of IIS6. Thus, membrane potential modulates 
the TPC] channel activity by imposing a voltage-dependent constraint 
on PtdIns(3,5)P2 activation and the upward movement of VSD2 under 
depolarization is a prerequisite for the PtdIns(3,5)P2-induced gate 
opening (Fig. 5c). 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized and investigators were not blinded to allocation during 
experiments and outcome assessment. 
Protein expression and purification. Mouse TPCl (MmTPC1, NCBI 
accession: NM_145853.2) containing a C-terminal thrombin cleavage site fol- 
lowed by a GFP tag and a 10x His tag was cloned into a pEZT-BM vector*! and 
heterologously expressed in HEK293F cells (Life Technologies) using the BacMam 
system (Thermo Fisher Scientific). The baculovirus was generated in Sf9 cells (Life 
Technologies) following the standard protocol and used to infect HEK293F cells 
at a ratio of 1:40 (virus: HEK293F, v/v) and supplemented with 10 mM sodium 
butyrate to boost protein expression. Cells were cultured in suspension at 37°C 
for 48h and collected by centrifugation at 3,000g. All purification procedures were 
carried out at 4°C. The cell pellet was re-suspended in buffer A (20 mM Tris, 
pH 8.0, 150mM NaCl) supplemented with a protease inhibitor cocktail (containing 
2\1g/ml DNase, 0.5 1g/ml pepstatin, 2\1g/ml leupeptin, and 1 :g/ml aprotinin and 
0.1mM PMSF) and homogenized by sonication on ice. MmTPC1 was extracted 
with 1% (w/v) n-dodecyl-3-p-maltopyranoside (Anatrace) supplemented with 
0.2% (w/v) cholesteryl hemisuccinate (Sigma Aldrich) by gentle agitation for 2h. 
After extraction, the supernatant was collected after a 60-min centrifugation at 
20,000g and incubated with Ni-NTA resin (Qiagen) using gentle agitation. After 
2h, the resin was collected on a disposable gravity column (Bio-Rad). The resin 
was washed with buffer B (20mM Tris, pH 8.0, 150mM NaCl and 0.06% glycol- 
diosgenin (Anatrace) supplemented with 20 mM imidazole. The washed resin 
was left on column in buffer B and digested with thrombin (Roche) overnight. 
After thrombin digestion, the flow-through containing untagged MmTPC1 
was collected, concentrated and purified by size exclusion chromatography on 
a Superdex 200 column (GE Heathcare) pre-equilibrated with buffer B. The peak 
fraction was pooled and concentrated to 4.7 mg/ml for cryo-EM analysis. To obtain 
PtdIns(3,5)P2_bound structure, the protein sample was supplemented with 0.5 mM 
PtdIns(3,5)P diC8 (Echelon Biosciences) for 30 min on ice before electron micros- 
copy grid preparation. 
Electron microscopy data acquisition. The cryo-EM grids were prepared by 
applying MmTPC1 (4.7 mg/ml, with or without 0.5 mM PtdIns(3,5)P2) to a 
glow-discharged Quantifoil R1.2/1.3 300-mesh gold holey carbon grid. Grids were 
blotted for 4.0s under 100% humidity at 4°C before being plunged into liquid 
ethane using a Mark IV Vitrobot (FEI). Micrographs were acquired on a Titan Krios 
microscope (FEI) operated at 300kV with a K2 Summit direct electron detector 
(Gatan), using a slit width of 20eV on a GIF-Quantum energy filter. Images were 
recorded with EPU software (FEI) in super-resolution counting mode with a 
super resolution pixel size of 0.535 A. The defocus range was set from —1.5 1m 
to —3m. Each micrograph was dose-fractionated to 30 frames under a dose rate 
of 4 e~ per pixel per s, with a total exposure time of 15, resulting in a total dose 
of about 50 e~ per A?. 
Image processing. Micrographs were motion corrected and binned twofold 
(yielding a pixel size of 1.07 A per pixel) with MotionCor2™. The CTF parameters 
of the micrographs were estimated using the GCTF program™. All other steps 
of image processing were performed using RELION vy.2.0**"». Initially, ~1,000 
particles were manually picked from a few micrographs. Class averages representing 
projections of MmTPC1 in different orientations were selected from the 2D clas- 
sification of the manually picked particles, and used as templates for automatic 
particle picking from the full dataset. For the Apo MmTPC1 dataset, 1,411,763 
particles were picked from 2,937 micrographs. The particles were extracted and 
binned 3 times (3.21 A per pixel). After 2D classification, a total of 1,117,348 
particles were finally selected for 3D classification using the AtTPC1 structure as 
the initial mode. Three of the 3D classes showed good secondary structure features 
and were selected and re-extracted into the original pixel size of 1.07 A. After 3D 
refinement with C2 symmetry imposed, and particle polishing, the resulting 3D 
reconstructions from ~536,000 particles showed a clear two-fold symmetry with 
a resolution of 3.5 A. We then performed a focused 3D classification with density 
subtraction to improve the density of transmembrane domain*. In this approach, 
only the density corresponding to the transmembrane domain was kept in each 
particle image, by subtracting the density for all other parts including the belt-like 
detergent density from the original particles. The subsequent 3D classification on 
the modified particles was carried out by applying a mask around the transmem- 
brane domain with all the particle orientations fixed at the value determined in 
the initial 3D refinement. After this round of classification, one class (~43,000 
particles) showed better density in the transmembrane domain. The corresponding 
particles before density subtraction from this class were selected and 3D-refined, 
yielding an electron microscopy map of 3.4 A for the entire channel. 

The data for MmTPC1 in the presence of PtdIns(3,5)P> were processed similarly 
to that of apo MmTPC1. In brief, 941,754 particles were picked from a total of 
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2,348 micrographs. After 2D classification, 620,307 particles were selected for 3D 
classification. Three classes with a total of ~245,000 particles were selected and 
combined for 3D auto-refinement, which resulted in a map with an overall reso- 
lution of 3.3 A. One round of 3D classification was then performed by focusing on 
the transmembrane domain. One class (~83,000 particles) showed better density in 
the transmembrane domain and was selected for final 3D refinement, yielding an 
electron microscopy map of 3.2 A. All resolutions were estimated by applying a soft 
mask around the protein density and the gold-standard Fourier shell correlation 
(FSC) = 0.143 criterion. ResMap was used to calculate the local resolution map*”. 
Model building, refinement and validation. De novo atomic model buildings 
were conducted in Coot**. Amino acid assignment was achieved based mainly 
on the clearly defined densities for bulky residues (Phe, Trp, Tyr and Arg). Real- 
space model refinement was performed in Phenix’. Models were validated using 
previously described methods, to avoid overfitting*“". The final structure models 
for both apo and PtdIns(3,5)P2-bound states include residues 66-701 and residues 
709-795. Residues 1-65, 702-708 and 796-817 are disordered and not modelled. 
The statistics of the geometries of the models were generated using MolProbity””. 
All the figures were prepared in PyMol” or Chimera“. Programs used for model 
building, refinement and validation are compiled by SBGrid®. 
Electrophysiology. In human TPC2, the Leul1Ala and Leu12Ala mutations at the 
N-terminal targeting sequence have previously been shown to promote channel 
expression and trafficking to the plasma membrane of the HEK293 cell, enabling 
channel activity measurement using patch clamp”“°. We therefore also intro- 
duced the equivalent mutations (Leul1Ala and Ile12Ala) to MmTPC1. HEK293 
cells overexpressed with the Leul1Ala/Ile12Ala mutant of MmTPC1 elicited 
much larger whole-cell currents than those expressed with wild-type MmTPC1 
(Extended Data Fig. 2a). Therefore, the Leul1Ala/Ile12Ala mutant was used and 
considered as the wild-type channel in all our recordings. All other mutations in 
our experiments were generated on the background of this plasma-membrane- 
targeting MmTPCl1. With the channels targeted to the plasma membrane, the 
extracellular side is equivalent to the luminal side of TPC1 in endosomes or 
lysosomes. MmTPC1 and its mutants were cloned into pCGFP-EU vector*”. About 
2 1g of the plasmid containing the C-terminal GFP-tagged MmTPC1 or its mutant 
was transfected into HEK293 cells grown in a six-well tissue culture dish using 
Lipofectamine 2000 (Life Technology). Forty-eight hours after transfection, cells 
were dissociated by trypsin treatment and kept in complete serum-containing 
medium and re-plated on 35-mm tissue culture dishes in a tissue culture incubator 
until recording. 

Patch clamp in whole-cell configuration was used to measure channel activity 
in most of the experiments except the measurements of ligand affinity and speci- 
ficity, which were recorded in excised patches (inside-out patches) using the 
voltage-independent Arg540GIn mutant. This mutant channel can be activated 
solely by PtdIns(3,5)P, and also yields much larger plasma membrane currents, 
which makes it more amenable for inside-out patches. The standard intracellular 
solution contained (in mM): 145 sodium methanesulfonate (Na-MS), 5 NaCl, 
4MgCh, 1 EGTA, 10 HEPES buffered with Tris, pH = 7.4. The extracellular solu- 
tion contained (inmM): 145 Na-MS, 5 NaCl, 1 MgCh, 1 CaCl, 10 HEPES buffered 
with Tris, pH =7.4. Various concentrations of PtdIns(3,5)P as specified in each 
experiment were added to the intracellular solutions to activate the channel. For 
patches in whole-cell configuration, the intracellular solution was in the pipette and 
the extracellular solution was in the bath; the solution arrangement was reversed 
for the inside-out patches. The lipid ligands used in our studies are phosphatidy- 
linositol-3,5-bisphosphate diC8 (PtdIns(3,5)P2 diC8, Echelon) and phosphatidy- 
linositol-4,5-bisphosphate diC8 (PtdIns(4,5)P diC8, Echelon). 

The data were acquired using an AxoPatch 200B amplifier (Molecular Devices) 
and a low-pass analogue filter set to 1 kHz. The current signal was sampled at a rate 
of 20kHz using a Digidata 1322A digitizer (Molecular Devices) and further ana- 
lysed with pClamp 9 software (Molecular Devices). Patch pipettes were pulled from 
borosilicate glass (Harvard Apparatus) and heat-polished to a resistance of 3-5 MQ). 
After the patch pipette attached to the cell membrane, a giga-seal (> 10GQ) 
was formed by gentle suction. The whole-cell configuration was formed by short 
zap or suction to rupture the patch. The inside-out configuration was formed by 
pulling the pipette away from the cell, and the pipette tip was exposed to the air 
for a short period in some cases. The holding potential was set to -70 mV. To 
generate G/Gmax versus V curves (G=I/V), the membrane was stepped from the 
holding potential (—70 mV) to various testing potentials (—100 mV to 100mV) 
for 1s and then stepped to —70 mV (Extended Data Fig. 2b). The peak tail currents 
were used to plot the G-V curve. Gmax was obtained from the peak tail current at 
100 mV testing potential. V1/2 and Z values were obtained from the fits of data 
with Boltzmann equation, in which Vj,2 is the voltage at which the channels have 
reached half of their maximum fraction open and Z is the apparent valence of 
voltage dependence. The same protocol was used to obtain current and voltage 
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relationships (I-V curve) of the wild-type MmTPCl (Fig. 3e, top trace), except that 
the peak current at each testing potential was used to generate the J-V curve. For 
voltage-independent Arg540GIn mutant, the holding potential was set to 0 mV, and 
the current and voltage relationship (I-V curve, Fig. 3e, bottom trace) was obtained 
directly by using voltage pulses ramp from —100 to 100 mV over 800-ms duration. 

For measuring ion selectivity of MmTPC1 and its mutants in whole-cell patches, 
10}1M PtdIns(3,5)P» was included in intracellular (pipette) solution to fully activate 
the channel. The membrane potential was stepped from the holding potential 
(—70 mV) to 100 mV for 1s to activate the channels, and then stepped to various 
testing potentials (—120mV to 4mV) for 1s (Extended Data Fig. 2f). The peak tail 
currents at various testing potentials were plotted to determine the reversal poten- 
tial (Viey). To measure the relative permeability between Na‘ and K*, the extracel- 
lular (bath) solution (in mM) was changed to 145 K-MS, 5 NaCl, 1 MgCh, 1 CaCh, 
10 HEPES buffered with Tris, pH 7.4. To measure the relative permeability between 
Na* and Ca’*, the extracellular solution (in mM) was changed to 95 Ca-(MS)>, 
5 CaCl, 10 HEPES buffered with Tris, pH 7.4. The ion permeability ratios were 
calculated with the equations: Pya/Px = [K]o/([Na]iexp(VievF/RT) - [Na].) and 
Pya/ Pca = 4[Ca]o/([Na]iexp( VievF/RT)(1 + exp(VrevF/RT))), in which Viey is the 
reverse potential, F is Faraday’s constant, R is the gas constant, T is the absolute 
temperature, o is extracellular and i is intracellular. 

All electrophysiological recording were repeated at least five times using 

different patches. Most data points shown are mean + s.e.m. (1 =5 independent 
experiments). 
Data availability. The cryo-EM density maps of the MmTPC1 have been deposited 
in the Electron Microscopy Data Bank under accession number EMD-7434 for 
the apo state, and accession number EMD-7435 for the PtdIns(3,5)P2-bound state. 
Atomic coordinates have been deposited in the RCSB Protein Data Bank under 
accession number 6C96 for the apo state, and accession number 6C9A for the 
PtdIns(3,5)P2-bound state. Source Data for Fig. 3c and Extended Data Fig. 2c, e 
are available in the online version of the paper. 
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Extended Data Figure 1 | Sequence alignment of MmTPC1, HsTPC1, 
AtTPC1, MmTPC2 and HsTPC2. Secondary structure assignments are 
based on the structure of PtdIns(3,5)P)-bound MmTPCl. Red dots mark 
the ligand-binding residues; black dots mark the S4 arginine residues 


and residues at the gating-charge transfer centre; cyan dots mark the key 
S6 gating residues; green dots mark the residues predicted to participate 
in Ca** coordination in EF-hand domains of AtTPC1. MmTPC1 and 
AtTPC1 share about 25% sequence identity. 
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Extended Data Figure 2 | See next page for caption. 
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Extended Data Figure 2 | Gating and selectivity of MmTPCl. 

a, Sample traces and current density (current/capacitance) of wild- 

type MmTPC1 and the L11A/112A mutant of MmTPC1, recorded in 

the whole-cell configuration with 100 1M PtdIns(3,5)P. in the pipette 
(cytosolic). The experiments were repeated five times independently 
with similar results. Data points for current density are mean +s.e.m. 
(n=5 independent experiments). The L11A/112A mutant elicited 

much larger whole-cell currents and was therefore used as the wild-type 
channel in all recordings. The extracellular side of MmTPC1 in plasma 
membrane is equivalent to the luminal side of MmTPC1 in lysosomes. 

b, Sample traces of PtdIns(3,5)P2-dependent voltage activation of 
MmTPCl. Whole-cell currents were recorded with varying PtdIns(3,5) 

P2 concentrations in the pipette (cytosolic) at pH 7.4. The experiments 
were repeated five times independently with similar results. c, G/Gmax-V 
curves of MmTPC1 at various PtdIns(3,5)P, concentrations. Boltzmann 
fit yields Vij. (mV) =21.6 + 1.2, 15.2+ 1.0, 16.140.9 and —2.041.0, 
and Z=0.78 + 0.04, 0.82 + 0.03, 0.89 + 0.02 and 0.84 + 0.05 for voltage 
activation in 0.05, 0.2, 2.0 and 101M cytosolic PtdIns(3,5)P2, respectively, 
in which Vj, is the membrane potential for half maximum activation and 


LETTER 


Z is apparent valence. All data points are mean + s.e.m. (n =5 independent 
experiments). d, Luminal pH modulates the voltage activation of 
MmfTPCl1. Whole-cell currents of MmTPC1 recorded in the presence of 
2M cytosolic PtdIns(3,5)P2 with a varying luminal (bath) pH of 7.4, 6.0 
or 4.6. Sample traces were obtained from the same patch. The experiments 
were repeated five times independently with similar results. e, G/Gmax-V 
curves of MmTPC1 at various luminal pH values. Boltzmann fit yields 

Vi =16.2£0.8mV, Z=0.91 + 0.02 at pH 7.4, Vj. =38.2+1.2mV, 
Z=0.95 + 0.02 at pH 6.0. All data points were normalized against Gmax 
obtained at 100 mV activation voltage and pH 7.4. All data points are 
mean + s.e.m. (n=5 independent experiments). f, Sample traces of whole- 
cell currents with 150 mM Na’ in the pipette solution, and either 150 mM 
Nat or 145mM K* and 5mM Na’ in the bath solution, and the I- V curves 
generated from the tail currents of the sample traces. g, Sample traces of 
whole-cell currents with 150mM Na‘ in the pipette solution and 150 mM 
Na* or 100mM Ca?" in the bath solution, and the J-V curves generated 
from the tail currents of the sample traces. Data in f and g were recorded 
with 101M PtdIns(3,5)P> in the pipette at pH 7.4 and both experiments 
were repeated five times independently with similar results. 
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Extended Data Figure 3 | Structure determination of PtdIns(3,5)P2- 
bound MmTPC1. a, Representative electron micrograph of PtdIns(3,5) 
P,-bound MmTPC1; 2,348 micrographs were used for structure 
determination. b, 2D class averages. c, Euler angle distribution of 
particles used in the final 3D reconstruction, with the heights of the 
cylinders corresponding to the number of particles. d, Final density maps 


3.2A 


coloured by local resolution. e, Gold-standard FSC curves of the final 3D 
reconstructions. f, FSC curves for cross-validation between the models 
and the maps. Curves for model versus summed map in black (sum), for 
model versus half map in blue (work) and for model versus half map not 
used for refinement in red (free). g, Flowchart of electron microscopy data 
processing for PtdIns(3,5)P2-bound MmTPC1 particles. 
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Extended Data Figure 4 | Structure determination of apo MmTPC1. 

a, Representative electron micrograph of apo MmTPC1; 2,937 
micrographs were used for structure determination. b, 2D class averages. 
c, Euler angle distribution of particles used in the final 3D reconstruction, 


with the heights of the cylinders corresponding to the number of particles. 


d, Final density maps coloured by local resolution. e, Gold-standard FSC 


3D classification 


3.4A 


curves of the final 3D reconstructions. f, FSC curves for cross-validation 
between the models and the maps. Curves for model versus summed map 
in black (sum), for model versus half map in blue (work) and for model 
versus half map not used for refinement in red (free). g, Flowchart of 
electron microscopy data processing for apo MmTPCI particles. 
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Extended Data Figure 5 | Sample electron microscopy density maps and sharpened with a temperature factor of —105 A”. e, f, Sample electron 
(blue mesh) for MmTPC1. a-d, Sample electron microscopy density microscopy density maps for the key parts of apo MmTPC1: ligand 
maps for various parts of PtdIns(3,5)P2-bound MmTPC1: IS1-IS6 and binding site (e) and S6 helices (f). The maps are low-pass filtered to 3.4A 
filter I (a), IIS1-IS6 and filter II (b), NAGs of Asn600 and Asn612 (c), and and sharpened with a temperature factor of —98.5 A?. Residues discussed 
PtdIns(3,5)P2-binding site (d). The maps are low-pass filtered to 3.2 A in main text are labelled in red. 
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Extended Data Figure 6 | Structure comparison between MmTPCl1 and — VSD 1 domains. The comparison of the VSD2 domains is shown in Fig. 3f. 
AtTPC1. a, Superposition of the overall structures of MmTPC1 (blue) and __ d, Superposition of cytosolic soluble domains. 
AtTPC1(salmon). b, Superposition of the pore regions. c, Superposition of 
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Asn648Ala and Asn649Ala filter mutants. The pipette solution contained _ shown in Fig. 2g. The experiments were repeated five times independently 
150 mM Na‘ and the bath solution contained 150mM Na’, or 145mM with similar results. 
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Extended Data Figure 8 | Voltage-sensing domains. a, Superimposition 
of MmTPC1 VSD1 structures in the PtdIns(3,5)P,-bound (green) and 
apo (pink) states with $1 helices removed for clarity. The MmTPC1 

VSD1 lacks some key features of canonical voltage sensors: the conserved 
aromatic residue on S2 and acidic residue on $3 that form the gating- 
charge transfer centre become Val152 and Lys177, respectively, in 
MmTPC1; the conserved basic residue at the R5 position becomes Phe209 
in MmTPC1; no arginine from IS4 is positioned in the gating-charge 
transfer centre. b, Superimposition of MmTPC1 VSD2 structures in the 
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PtdIns(3,5)P2-bound (orange) and apo (cyan) states. c, Sample traces of 
voltage activation of MmTPC1 and its IS4 arginine mutations, recorded in 
whole-cell configuration with 2 1M PtdIns(3,5)P» in the pipette. Peak tail 
currents were used to generate the G/Gmax- V curves shown in Fig. 3c. The 
experiments were repeated five times independently with similar results. 
d, Sample traces of voltage activation of Arg546GIn mutation, recorded 

in whole-cell configuration with 21M and 100,.M PtdIns(3,5)P, in the 
pipette. The experiments were repeated five times independently with 
similar results. 
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Extended Data Figure 9 | PtdIns(3,5)P2-binding in MmTPC1. a, Model 
of bound PtdIns(3,5)P> (left) and its electron microscopy density (right). 
Density of two other membrane lipid molecules (blue mesh in left panel) 
was also observed near PtdIns(3,5)P. in the structure. b, Current density 
of mutations at the PtdIns(3,5)P-binding site measured at —100 mV in 
whole-cell recordings. All mutants were generated on the background 

of Arg540GIn mutant, which was used as control. All data points are 
mean + s.e.m. with the number of independent experiments for each 


mutant shown in parentheses. c, Sample J-V curves of Arg540GIn mutant 
recorded in excised patches with varying concentrations of PtdIns(3,5) 

P, in the bath (cytosolic). The experiments were repeated five times 
independently with similar results. Currents at —100 mV were used to 
generate the concentration-dependent PtdIns(3,5)P, activation curve 
shown in Fig. 4c. Imax is the current recorded at —100 mV with 101.M 
PtdIns(3,5)P, in the bath. d, Structural comparison at the ligand-binding 
site between the PtdIns(3,5)P2-bound (green) and apo (salmon) states. 
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Extended Data Table 1 | Cryo-EM data collection and model statistics 


Ptdins(3,5)P2 - Apo 
bound MmTPC1 MmTPC1 
(EMD-7435) (EMD-7434) 
(PDB 6C9A) (PDB 6C96) 
Data collection and 
processing 
Magnification 46730 46730 
Voltage (kV) 300 300 
Electron exposure (e /A*) ~50 ~50 
Defocus range (um) -1.5 to -3.0 -1.5 to -3.0 
Pixel size (A) 1:07 1.07 
Symmetry imposed C2 C2 
Initial particle images (no.) 941,754 1,260,054 
Final particle images (no.) 82,819 42,870 
Map resolution (A) 3.2 3.4 
FSC threshold 0.143 0.143 
Refinement 
Initial model used (PDB code) 5E1J 5E1J 
Model resolution (A) 3.2 3.4 
FSC threshold ; 0.143 0.143 
Map sharpening B factor (A°) — -105.07 -98.52 
Model composition 
Non-hydrogen atoms 12182 12090 
Protein residues 12004 12004 
Ligands _ 178 86 
B factors (A*) 
Protein 80.93 98.57 
Ligand 67.59 81.03 
R.m.s. deviations 
Bond lengths (A) 0.009 0.010 
Bond angles (°) 1.286 1.309 
Validation 
MolProbity score 1.41 1.5 
Clashscore 2.79 3.5 
Poor rotamers (%) 0 0.3 
Ramachandran plot 
Favored (%) 94.99% 94.85% 
Allowed (%) 5.01% 5.15% 
Disallowed (%) ) 0 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature25997 


Corrigendum: Commensal 
bacteria make GPCR ligands that 
mimic human signalling molecules 


Louis J. Cohen, Daria Esterhazy, Seong- Hwan Kim, 
Christophe Lemetre, Rhiannon R. Aguilar, Emma A. Gordon, 
Amanda J. Pickard, Justin R. Cross, Ana B. Emiliano, 

Sun M. Han, John Chu, Xavier Vila-Farres, Jeremy Kaplitt, 
Aneta Rogoz, Paula Y. Calle, Craig Hunter, J. Kipchirchir Bitok 
& Sean F. Brady 


Nature 549, 48-53 (2017); doi:10.1038/nature23874 


In this Article, the description in the Methods of the human stool 
samples used for the analysis presented in Extended Data Fig. 9 was 
incomplete. All samples were collected with informed consent under 
protocol numbers 09-067, 09-141 and 06-107 at Memorial Sloan 
Kettering Cancer Center (MSKCC) and stored as part of a biospecimen 
repository. All sample processing was done at MSKCC. All patients 
underwent a bone marrow transplant which was standard of care 
and part of an observational study (not a clinical trial, as originally 
stated in the Methods). Patient age, gender and transplant indication 
are provided in the table now added to Extended Data Fig. 9. The 
Reporting Summary has been updated (the original uncorrected 
Reporting Summary is provided as the Supplementary Information 
to this Corrigendum). The original Letter has been corrected online. 


Supplementary Information is available in the online version of this Corrigendum. 


5 APRIL 2018 | VOL 556 | NATURE | 135 
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CORRECTIONS & AMENDMENTS 


ERRATUM 
doi:10.1038/nature26162 


Erratum: Asparagine 
bioavailability governs metastasis 


in a model of breast cancer 


Simon R. V. Knott, Elvin Wagenblast, Showkhin Khan, 

Sun Y. Kim, Mar Soto, Michel Wagner, Marc- Olivier Turgeon, 
Lisa Fish, Nicolas Erard, Annika L. Gable, Ashley R. Maceli, 
Steffen Dickopf, Evangelia K. Papachristou, Clive S. D’Santos, 
Lisa A. Carey, John E. Wilkinson, J. Chuck Harrell, 

Charles M. Perou, Hani Goodarzi, George Poulogiannis & 
Gregory J. Hannon 


Nature 554, 378-381 (2018); doi:10.1038/nature25465 


In this Letter, several errors were inadvertently introduced during 
the production process. In Fig. 3d, the blue bars should be labelled 
‘L-asparaginase’ rather than ‘L-asparagine’ In the main text, “These 
were collected onto Matrigel...” should be “These were placed onto 
Matrigel...’, and in the Methods, ‘corresponding proteins’ should 
be ‘corresponding peptides’ in the text: “... proteins were removed 
from subsequent analysis if fewer than 5 corresponding peptides 
were identified in any sample. The legend to Extended Data Fig. 6b 
should state “Volumes of orthotopic.’ rather “Volumes of ortho- 
tropic... Finally, in the HTML the legend to Supplementary 
Table 4 was repeated for Supplementary Table 5. The legend to 
Supplementary Table 5 should have stated: ‘Amino acid composi- 
tion of serum with and without L-asparaginase treatment. 4T1-T 
cells harbouring the non-targeting Renilla shRNA were injected into 
immunocompromised mice. Five mice each were either injected 
with 60 U L-asparaginase or PBS 5 days per week. After blood 
collection and serum isolation, free amino acids were quantified using 
High Performance Liquid Chromatography (HPLC) and a fluorometric 
detector’ These errors have all been corrected online. 
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CORRECTIONS & AMENDMENTS 


ERRATUM 
doi:10.1038/nature26163 


Erratum: Evolutionary history 
resolves global organization of root 


functional traits 


Zeqing Ma, Dali Guo, Xingliang Xu, Mingzhen Lu, 
Richard D. Bardgett, David M. Eissenstat, M. Luke McCormack 
& Lars O. Hedin 


Nature 555, 94-97 (2018); doi:10.1038/nature25783 


In this Letter, owing to an error during the production process, only 
author L.O.H. was listed as a corresponding author, instead of both 
D.G. (guodl@igsnrr.ac.cn) and L.O.H. (lhedin@princeton.edu). This 
has been corrected online. 
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BRIEF COMMUNICATIONS ARISING 


Improving outcomes in congenital cataract 


ARISING FROM H. Lin et al. Nature 531, 323-328 (2016); doi:10.1038/nature17181 


Lens regeneration after cataract surgery in infants is a clinical 
phenomenon with which paediatric ophthalmologists battle. Lin et al. 
reported a novel surgical technique that aimed to convert this post- 
operative complication into an alternative therapy for children aged 
under 24 months. However, the early outcomes reported in the experi- 
mental group fall short of outcomes achievable through conventional 
treatment in this population. As the authors offer no comment or expla- 
nation for this discrepancy, this new approach cannot be considered 
effective or safe for affected children. There is a Reply to this Comment by 
Liu, Y. et al. Nature 556, https://dx.doi.org/10.1038/nature26150 (2018). 

Regeneration of residual lens cells following surgical removal of con- 
genital cataract can result in re-opacification, which needs to be treated by 
further intraocular surgery, necessitating repeated general anaesthetics 
during a sensitive period of neurodevelopment. The adaptation of this 
regenerative process by Lin et al.', in which a novel surgical method of 
cataract removal preserves endogenous lens cells, enabling functional 
lens regeneration, may eventually lead to the development of treatments 
for degenerative disease. However, issues relating to adult cataract (an 
age-related degenerative process) and congenital and infantile cataract 
are conflated in their report. Cataract is virtually universal in older age, 
making cataract surgery one of the most common surgical procedures, 
with enviably excellent visual outcomes. By contrast, infantile cataract is 
uncommon, affecting 3-15 per 10,000 children worldwide and, by defi- 
nition, present from birth or early infancy. We now understand that 
mutations within the genes responsible for the production or orchestra- 
tion of the lens epithelial progenitor/stem cells (LECs) are responsible 
for the majority of bilateral congenital or infantile cataract, even in 
cases where there is no family history’. Thus the treatment approach 
adapted by Lin et al.', which relies on regeneration of lens stem cells 
without addressing the persisting underlying genetic defect, cannot be 
definitive. Children treated using this technique may require further 
surgery but Lin et al. do not acknowledge this in the article. 

The rationale for the trial reported by Lin et al.' is the need to address 
adverse outcomes associated with the use of artificial intraocular lenses, 
which are implanted in some children to replace the focusing power 
of the removed cataractous lens*. However, surgery with intraocular 
lens implantation was not the ‘control’ standard approach evaluated 
within the report. Equivalence in vision outcomes between their inter- 
vention and control groups was reported, but these should also have 
been assessed against the extant benchmark. Outcomes in infantile 
cataract have improved substantially over the past few decades, largely 
owing to the application of basic neuroscientific understanding of 
sensitive and critical periods in visual neurodevelopment. Cataract- 
related childhood visual impairment is largely due to bilateral stimulus 
deprivation amblyopia: the failure to restore a normal trajectory of 
visual neurodevelopment during a brief and finite window of oppor- 
tunity. This critical window closes at some point during the first six 
months of life. Thus, younger age at surgery is the strongest predictor 
of better visual outcome and, in cataract present from birth, the window 
for intervention is conventionally considered to be the first six to eight 
weeks of life. Hence whole-population newborn screening programmes 
exist in many countries to ensure early diagnosis of congenital cataract 
and prompt referral for specialist treatment. In settings where such 
programmes do not yet exist, late diagnosis and treatment, resulting 
in irreversible amblyopia, drives poor visual outcomes. We have 
previously, on behalf of the British Isles Congenital Cataract Interest 
Group, reported outcomes within a contemporaneous, nationally 


representative cohort of children undergoing surgery in the British 
Isles for congenital and infantile cataract in the first two years of life 
(IoLunder2 study)*. These outcomes are comparable to those found in 
other contemporary reports® and are more than twofold better than 
those reported by Lin et al. in either their experimental or control 
groups. Indeed, the mean acuity achieved in their trial is the threshold 
for legal definition of blindness, an outcome that would lead most 
ophthalmologists, and probably most parents, to question the value of 
this new proposed intervention. 

Effective treatment for congenital cataract requires, alongside 
surgery, post-operative management of the loss of refractive (focusing) 
power through removal of the natural lens. Failure to appropriately 
manage this results in dense amblyopia. As the method described in Lin 
et al. involves an 8-month post-operative period of lens regeneration 
with consequent partially obscured and poorly focused vision, the 
inevitable resultant amblyopia may explain the poor visual outcomes. 
The authors offer no other explanation, and do not describe how the 
rapidly changing and highly defocused refractive state (moving through 
18 diopter units of refractive error in 8 months) was managed. Had the 
report adhered to the international reporting standard of CONSORT® 
it might be have been possible to assess the quality (internal validity) 
and generalizability (external validity) of the trial. For example, it is 
necessary to know: whether the control and intervention groups were 
equivalent with respect to baseline clinical characteristics (particu- 
larly age at surgery); how randomization was undertaken; the power 
calculation and the primary outcome and secondary outcomes on 
which this was based; how clustering by surgeon was addressed; and 
how clustering and correlation of outcome data were addressed in the 
analysis, given that both eyes of each subject were treated and analysed. 
The authors have described their study as a phase 1 trial, but the aim of 
such an investigation is to assess adverse outcomes of treatment, such 
as uncorrected high or irregular refractive outcome, which will drive 
the visual system towards amblyopia and resultant visual impairment. 

As the paper stands, it is not possible to agree with its principal 
conclusion that it provides evidence supporting the superiority of the 
novel treatment. A tempered report, clearly articulating the limitations 
of the approach with respect to outcome and permanency of effect, 
would have avoided giving the false impression that the new approach 
can be expected to supercede current treatment practices. 
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Lens regeneration in children 


ARISING FROM H. Lin et a/. Nature 531, 323-328 (2016); doi:10.1038/naturel7181 


Congenital cataracts are the primary cause of treatable childhood blind- 
ness worldwide, affecting about four infants per 10,000 live births!. 
Current surgical techniques have helped thousands of patients, but the 
limitations of these techniques?“ led Lin et al.° to report an alternative 
approach that they claim leads to a regenerated lens with refractive 
capacity. We have concerns regarding features of the presented data 
and the conclusions reached in the Article. This has implications for 
our patients who ask for such surgical intervention. There is a Reply to 
this Comment by Liu, Y. et al. Nature 556, http://dx.doi.org/10.1038/ 
nature26150 (2018). 

Embryologically, the lens of the eye is derived from the surface 
ectoderm®”’ and shares many properties of ectodermal tissue. Lens 
cortex continues to be produced throughout life, and lens size continues 
to increase owing to progressive elongation of slowly dividing lens 
epithelial cells (LECs) into fibre cells that lose their nuclei*"°. LECs 
are capable of unwanted proliferation in aged eyes; ophthalmologists 
observe this routinely after cataract extraction, when LECs proliferate 
and form semi-transparent scar-like tissue such as Soemmerring 
(S6mmering) rings and lens pearls'*. Lens transparency is main- 
tained as the result of highly organized fibre cell packing with extra- 
cellular spaces that are narrower than the wavelength of light. The 
authors claim that if they perform a cataract extraction in a way 
that reduces the injury to the lens epithelium, the normal natural 
epithelial healing response will lead to a new biconvex, transparent 
lens with flexibility (accommodative power). However, the authors 
concede in their Reply to this BCA that with their intervention it 
is unreasonable to expect a completely normal regenerated lens. 
Without a completely normal lens, including clarity, normal shape, 
and normal size, one would predict that normal visual function would 
not develop and children’s visual development would suffer from 
amblyopia. The lack of clarity and normal structure of the regen- 
erated lentoids is highlighted for example in figure 3b of Lin et al.°, 
where the image of a rabbit lens seven weeks after surgery shows an 
optically irregularly shaped lens, not the biconvex shape of a normal 
lens®. In addition, figure 3c and f of ref. 5 shows images of rabbit lenses 
with posterior subcapsular cataracts in the visual axis that would 
decrease vision. Furthermore, histopathology was not presented in the 
rabbit regenerated lenses at maximum axial length, and thus extended 
data figure 7a of Lin et al.° is insufficient to assess the quality and nor- 
mality of lens regeneration. To demonstrate a normal reconstituted lens, 
it is necessary to provide sagittal lens sections and lens measurements 
that encompass the nodal point of the eye. The lens imperfections 
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demonstrated in figure 4a of ref. 5 are multiple, not single including 
the visual axis, and represent more than just the loss of LECs and scar- 
ring at the site of capsulorhexis. Such changes are by definition cata- 
ractous. The images of postoperative human eyes in figure 5f of ref. 5 
appear to show hazy (that is, cataractous) lenses. More importantly, 
extended data figure 8e of ref. 5 shows no regenerated lens six months 
after lens surgery. Instead, a single light reflex is shown, which is likely 
to be from the lens capsule alone with no lens cortex and no evidence 
of a clear, biconvex crystalline lens that would be required for normal 
vision (compare the single lens capsule slit lens reflex of extended data 
figure 8e of ref. 5 with the normal lens reflex in figure 3b). 

In the clinical setting, a clear view by indirect ophthalmoscopy (as 
shown in extended data figure 8c, d of ref. 5) in no way signifies that 
the lens casts a clear and focused image on the retina. It is easy for 
indirect ophthalmoscopy to get clear views through lenses with legally 
blinding cataracts. There is no information about the authors’ manage- 
ment of the children’s refractive status. Importantly, there remains the 
distinct possibility that the proposed surgical technique may cause infe- 
rior visual outcomes. Comparing visual acuity measures in studies of 
children can be inherently difficult because of different clinical charac- 
teristics and the use of different measurement techniques. Nevertheless, 
the visual outcomes reported in both the experimental and convention- 
ally treated groups were unacceptably poor. Six months after surgery, 
the mean visual acuity was equivalent to 20/200 (6/60), whereas other 
investigators have reported that 60% of children with bilateral infantile 
cataracts achieve an acuity of 20/40 or better after standard ophthalmic 
care, and almost all achieve better acuity than the mean visual acuity 
reported by Lin et al.°. We would welcome a graphic, detailed distribu- 
tion of the visual acuity results from the authors’ patients (rather than 
only means or summarized data). Follow-up data covering more than 
one year represents another important clinical parameter. Finally, as the 
authors state that they did not exclude all genetic causes of congenital 
cataracts, and that they did not perform any gene editing or mani- 
pulation of the lens epithelial cells, we question why the proliferating 
LECs, presumably with genetic variants causing the original congenital 
cataracts, did not recapitulate the original lens opacity. One cannot just 
assume that germline mutations would constitute a minority of their 
cases without having data from their patients to support such a claim. 

Although the current surgical approaches for congenital cataracts 
have adverse events and several limitations that have been identified in 
controlled long-term studies, Lin et al.° do not provide sufficient short- 
term or long-term supportive evidence to support their statement that 
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their approach may afford an infant eye a good chance of an improved 
visual outcome. The study is limited by varied follow-up, poor visual 
acuities, insufficient details regarding adverse events, and a lack of infor- 
mation about the need for additional surgeries. These limitations should 
alert us to the need to proceed cautiously. Using such an unproven 
experimental therapy in both eyes of a patient is not appropriate. 

We all support innovation and disruptive technology. Yet, at the 
same time, it is necessary to pay careful attention to details including 
thorough documentation and achievement of long-term milestones 
to support the conclusions. One should be even more vigilant when 
proposing a new therapy for our youngest patients, who represent the 
most vulnerable population. It is important to report both study results 
and limitations in a clear and balanced fashion, particularly to parents 
who desperately want the best possible outcome for their infants. 
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Replying to: D. G. Vavvas et al. Nature 556, http://dx.doi.org/10.1038/nature26149 (2018); A. L. Solebo, C. J. Hammond & J. S. Rahi Nature 


556, http://dx.doi.org/10.1038/nature26148 (2018) 


In the accompanying Comments'”, Vavvas et al. describe issues 
regarding the normality of lens regeneration and Solebo et al. describe 
improving outcomes of congenital cataract. We welcome these 
Comments on our paper. 


In our Article’, we did not claim that a completely normal lens was 


regenerated. Instead, our hypothesis was that current surgical methods 
for paediatric cataract, namely anterior continuous curvilinear capsu- 


lorhexis, may impair the integrity of lens epithelial stem cells (LECs). 
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Vavvas et al.' highlighted several figures as ‘cataractous. While these 
lenses were mostly clear (particularly in the visual axis), we did not 
claim that they were completely normal and explicitly acknowledged 
their imperfections, which mainly reflected loss of LECs, mild periph- 
eral scarring at the capsulorhexis site, and anterior—posterior capsule 
adhesions that resolve over time. The histopathology sections inter- 
preted by Vavvas et al. as showing small or irregular lenses! were 
intentionally cut offset from the axial centre of the lens to minimize 
disruption to the lens cortex. Furthermore, dissected lenses often 
shrink upon alcohol dehydration. Thus, irregularities in size or shape 
were essentially fixation artefacts. Vavvas et al.' also comment on the 
lack of quality office photographs, although infants usually do not 
cooperate with quality office photography, and this was also not done 
in the recent NIH Infant Aphakia Treatment Study (IATS)*. 

Regarding concerns about amblyopia, all patients in our study under- 
went monthly post-operative correction of refractive error with either 
spectacles or contact lenses to maintain an appropriate refractive state 
consistent with age. 

Regarding visual outcomes, the studies*® cited by Vawvas et al.' are 
not comparable to ours for several reasons. First, the mean age was 
higher (5.3 years and 10.2 years compared with under 2 years in our 
study), which may have confounded visual outcomes, as the IATS noted 
that follow-up length can affect visual acuity comparisons‘. Second, 
the evaluation of visual acuity differed (Teller Acuity Cards for visual 
resolution? versus Snellen Visual Acuity for visual recognition®®). 
Although Teller cards can be roughly translatable to Snellen equiva- 
lents, accuracy and false comparison concerns usually preclude this. To 
illustrate, the IATS reported logMAR grating acuities at 1 year of age 
and not Snellen equivalents‘. Third, only 60% of children had 20/40 or 
better in ref. 5, and only 70% were able to complete Snellen visual acuity 
testing in ref. 6. The values reported in those studies therefore likely 
reflect bias of the data towards a seemingly better outcome. 

Similarly, Solebo et al.’ also refer to a group of children operated on 
via primary intraocular lens implantation at a median of 8 months of 
age’. The description provided in ref. 7 leads us to believe that these 
children may have a different form of cataract (developmental) than the 
one exhibited by our group of patients (congenital), making direct com- 
parisons of outcome difficult and inappropriate owing to the variation 
in impact on amblyopia. 

In commenting on our outcomes, Solebo et al.” did not account 
for normal age-related changes in visual acuity. During development, 
20/200 is close to normal vision for six months of age, and 20/50- 
20/80 is close to normal vision for one year of age, depending upon 
the method of testing used. The visual acuity levels in our population 
were appropriate and not inferior to their results if age is appropri- 
ately considered. In addition, an error in vision reported in our initial 
paper was corrected in a subsequent Corrigendum®. Therefore, their 
comments on our visual acuity results are now out of context and 
inaccurate. 

Both Vavvas et al.' and Solebo et al.” mention concern of recurrent 
cataracts in children with underlying germline mutations. We agree 
that in these patients, cataracts would be expected to recur eventually. 
However, we excluded infants with a family history of ocular disease 
and were not aware of any inherited mutations among the children 
in our study. Note that lens removal and placement of an intraocular 
lens implant (IOL) also does not address any underlying genetic 
mutations. All the regenerated lenses in our study were all initially 
transparent. Only two lenses became cloudy again after one year and 
required further surgery. However, those patients had a relatively clear 
lens during a critical period in visual development, thus avoiding high 
risk of amblyopia. 
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New techniques in medicine deserve careful consideration, as 
well as established reproducibility, before adoption. The intent of 
this minimally invasive surgical method for paediatric cataracts was 
precisely to “First do no harm;” that is, to allow regeneration of a func- 
tional lens with greater transparency of the visual axis, while avoiding 
the side effects and complications associated with current methods. 
The IATS reported an 81% incidence of adverse events when IOLs 
were implanted in babies, and 72% of their IOL group required addi- 
tional surgery under general anaesthesia versus 21% of the aphakic 
group’. Our new approach may afford the infant eye a better chance for 
improved ocular development and visual outcome, without the need for 
a more invasive procedure with increased risk of ocular and possibly 
neural complications. We are currently conducting a longer-term follow 
up study to gain additional data about long-term safety and efficacy. 
Generation of a completely normal and functional lens is our ultimate 
goal; this will require the creation of a conducive environment and 
scaffold for LES proliferation and differentiation. 

The author list of this Reply comprises those authors involved in 
clinical investigation of lens regeneration. Those authors of the original 
paper who were involved in animal and cell culture experiments and 
did not contribute to this response are not listed here. 
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Investigating non-Joulian magnetostriction 


ARISING FROM H. D. Chopra & M. Wuttig Nature 521, 340-343 (2015); doi:10.1038/nature14459 


Ferromagnetic materials change their shape under an applied magnetic 
field—a phenomenon known as magnetostriction. This phenomenon 
was first described for iron by Joule in 1842, and is generally believed 
to be volume-conserving. We therefore read with interest the Letter 
by Chopra and Wuttig', which reports that samples of Fejo9_,-Gax 
(galfenol) crystals demonstrate “giant” non-volume-conserving (non- 
Joulian) magnetostriction, and embarked on an extensive study of crys- 
tals of similar compositions, dimensions and heat treatments, using 
strain gauges and capacitive dilatometry to measure magnetostriction 
for many different combinations of strain direction and applied mag- 
netic field. In every case, we found that the volume was conserved 
within experimental error, and so we conclude that magnetostriction 
in galfenol can generally be regarded as Joulian. 

The initial claim! was based on strain-gauge measurements 
in the plane of slow-cooled or quenched, disk-shaped crystals of 
Feg) 9Gaj7.; and Fe73,9Gaz¢,1. These samples were 5 mm in diameter and 
0.4-0.5mm thick. A magnetic field of up to 3,000 Oe was applied 
in-plane. The crystals were found to expand in all, or almost all, of the 
directions that were tested. No measurement of strain was reported 
for the [001] direction perpendicular to the plane of the disks because 
it was assumed that “a negligible magnetization normal to the plane 
of the disk at comparable fields implies that no volume change occurs 
along [001].”! Strain-gauge data for the [001] direction of another 
slow-cooled Feg2 9Gaj7.; crystal were published subsequently as an 
Addendum’, in support of the original assertion. 

In our attempt to verify the idea that the enhanced magnetostriction 
of galfenol is largely non-Joulian, we grew 12 different rod-, disk- or 
cuboid-shaped crystals with compositions of Feg3;Gaj7 or Fe74Gaz¢, 
heights (or thicknesses) ranging from 70 mm to 0.1 mm and diame- 
ters (or widths) ranging from 16 mm to 7 mm. Here we focus on the 
four of these crystals that were subjected to the same heat treatments 
as described in ref. 1 (annealed at 1,033 K for 30 min and then either 
quenched or slow-cooled at 10K min~'). We measured the magne- 
tostriction (A, in parts per million (p.p.m.)) using strain gauges or, 
for the disk-shaped crystals with thicknesses of 0.5mm, by capacitive 
dilatometry? in the [001] direction. 

The most complete datasets were obtained for a single crystal of 
Feg3Ga,7 with dimensions of 10.6 mm x 10.6 mm x 2.4mm in the 
quenched and slow-cooled states. For this crystal we measured the satu- 
ration magnetostriction along the [100], [010], [001], [110] and [110] 
directions in a magnetic field large enough to saturate the magnetiza- 
tion applied along any one of these directions, yielding 17 independent 
measurements for the crystal in each state. Defining the magnetostric- 
tion components (in p.p.m.) for a given applied field (i) and measure- 
ment (j) direction as \9 and 5° for the quenched and slow-cooled 
states, respectively, and using brackets ‘[...]’ to denote an array of these 
components, we find 
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for i, j € {[110], [110], [001]}. Fitting the measured data to minimize 
the discrepancy between the measured values and calculated 
values (see below) yields the numbers given in parentheses in 
equations (1) and (2). 

The saturation magnetostriction j; is the linear strain Alj/ljin an 
applied field that is large enough to saturate the magnetization. We 
compare our data with the results expected from the standard expres- 
sion for Joulian magnetostriction of a crystal with cubic symmetry’: 


1 


3 
A= 5 00 a3? t ay t 2B? G3) 
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where Ajo9 and \;;; are the two intrinsic saturation magnetostriction 
constants of the cubic material, and a and ( are the cosines that 
define the magnetization direction (i) and the measurement direction 
(j), respectively; for example, when the field is applied along i = [100] 
(the x direction) and the magnetostriction is measured along j = [010] 
(the y direction), a,=1, ay=a,=0, By=1 and 6,= 8,=0. More 
general expressions for other lattice symmetries have been derived 
previously°. 

We make no assumptions about the domain structure in the initial 
demagnetized state. Defining n,, n,andn, as the fractions of domains 
oriented along the three principal directions, [100], [010] and [001], 
respectively, and assuming that the strains are additive, the predicted 
magnetostriction based on equation (3) is 


3 Ny+Nz, —Ny —n,z 
[Ai]= = joo Tnx Ny +Nz Nz (4) 
2 =i, —nNy Nyt Ny 


for i, j € {[100], [010], [001]} and 


3 Nn, nN, —2n, 
[Aj = ra nz Nz —2n, 
—(nytny) —(ny+ny) 2(nyt+ny) 5 
i) 
P nytny —(nx+ny) 0 
+7 Al (ne + ny) nytn, 0 
0 0 0 


for i, j € {[110], [110], [001]}. As in equations (1) and (2), the rows of 
[Aj] in equations (4) and (5) correspond to the applied field direction 
(i) and the columns to the measurement directions (f). 
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Consider the first element in the array in equation (4) as an example, 
i=j—=[100] = 1. For an initially +x-aligned domain, a,=+1 and 
Qy = az = 0. Because we are measuring along the x direction, 3,=1 and 
G,=6,=0. Putting these numbers into equation (3), we find, A}, = Ar00- 
But there is no difference in strain between the initial and final states 
when we flip the a, = —1 domains; it remains A}, = Ajo9. The shape of 
the sample is unchanged by the magnetization process of the x-aligned 
domains, so they do not contribute to this element of the magnetostric- 
tion. However, for an initially y-aligned domain, ML =—A100 / 2, because 
x= Az=0, y= +1, 3,=1 and 3,=3,=0 when we measure along x. 
A field applied along x rotates the magnetization of the initially 
y-aligned domains to a, =1 and ay=a,=0, yielding A}, = Axo0- The 
net magnetostriction contributed by y-aligned domains is therefore 
Aioo — (—At00/2) = 3A100/2. The same is true for an initially z-aligned 
domain: A}, = 3A;90/2. Consequently, A} ,= 0 x nx + 3A100/2 * My + 
3Atoo/2 X Nz=3A100(Ny + Nz)/2, as shown in equation (4). Similar argu- 
ments can be used to derive the other elements in equations (4) and (5). 

The Joulian condition states that the sum of the elements of each row 
of [Aji] is zero; this condition is fulfilled in equations (4) and (5). 
Furthermore, Aj99 can be deduced from the elements in any column of 
either equation independently of the initial domain structure because 
ny + Ny + nz=1. There are therefore three unknown parameters (n,, 
ny and A,99) for the [100], [010] and [001] directions and four (n,, nj, 
Azoo and Aj11) for the [110], [110] and [001] directions; but there are 
many more data in equations (1) and (2), in which the best fits to the 
data are given in parentheses. The magnetostriction is well fitted by the 
Joulian model with fit parameters ny = 35%, ny = 34% (implying 
Nz= 31%), 3Aj00/2 =236 + 6 p.p.m. and 3A,;;=25 +3 p.p.m. for the 
quenched state of FegsGaj7, and n,=49%, ny =49% (implying n,=2%), 
3X100/2 = 225 +4p.p.m. and 3A);;=23 +8 p.p.m. for the slow-cooled 


Table 1 | Domain distributions and magnetostriction in disk-shaped 
Fez00_xGa, crystals 


Row sum, 3 [A#S] 9x (%) My (%) M2 (%)- 3Aro0/2. 


(p.p.m.) (p.p.m.) 

i=[100] i=[010] 
x=17 (quenched) —5 -10 85 15 O 265+7 
x=17 (slow-cooled) 13 5 40 40 20 25147 
x=26 (quenched) -3 0) 44 45 11 17443 
x=26 (slow-cooled) 10 10 38 34 28 =160+4 


state. The small discrepancies between the measured and fitted 
values—of 2 p.p.m. and 4p.p.m. for the quenched and slow-cooled 
states, respectively, when averaging the magnitudes of the differences 
over the 17 independent measurements ((1/17) < Age — agent) — 
could be related to the width of the domains relative to the width and 
position of the strain gauges or to the uniformity of the initial state. The 
average magnitude of the row sum is 3 p.p.m. for the quenched crystal 
and 7 p.p.m. for the slow-cooled one. 

We also have nearly complete (100) data for quenched and slow- 
cooled, disk-shaped (diameter, 5mm; thickness, 0.5mm) crystals 
of Feg3Gay7 and Fe74Gaz¢. Only measurements of 33 (i=j =[001] = 3) 
are missing, because the thin disks tend to twist in the large perpen- 
dicular field that is needed for saturation. For this crystal we obtained 
eight independent measurements, for three fit parameters. These data 
are also in excellent agreement with the Joulian model in equation (3), 
with average discrepancies of 2-3 p.p.m. between the measured and 
calculated magnitudes of 5. The results of the fit and the row sums 
( yj [ART for fields applied along the i= [100] and i= [010] directions 
are summarized in Table 1. Separate determinations of 199 obtained 
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Figure 1 | Magnetostriction and magnetization of Feg3Gaj7 single 
crystals. a, b, Magnetorestriction \ for quenched (a) and slow-cooled (b) 
disk-shaped crystals (diameter, 5 mm; thickness, 0.5 mm) in an in-plane 
magnetic field applied along the i= [010] direction, measured in three 
orthogonal directions: j = [100] (black squares), j= [010] (red circles) and 
j=[001] (blue triangles). c, Magnetorestriction measured similarly for an 
as-grown 10mm x 10mm x 10mm cubic crystal. d, Magnetization curves 
for an as-grown disk-shaped crystal (diameter, 5 mm; thickness, 0.88 mm) 
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measured with the field along the [010] direction. The saturation field in 
a-d depends on the shape of the sample. e, Magnetostriction calculated 
from equation (3), with 3Aj00/2 = 250 p.p.m. and 3A); = 26 p.p.m., when 
the measurement direction is parallel to the magnetization direction in 
saturation, for an isotropic domain distribution (top; n.= ny =n, = 1/3) 
and an in-plane domain distribution (bottom; n, =n, = 1/2, n,=0). The 
radial coordinate represents the magnetostriction (in p.p.m.), while the 
angular coordinates indicate the direction of the saturation magnetization. 
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by summing the elements in the first or second columns of the array 
in equation (4), which are independent of the domain structure, agree 
with the values in Table 1 to within 5 p.p.m. 

Our results suggest that the heat treatment modifies the initial 
domain structure of the crystals, which have the same dimensions as 
those studied in ref. 1, but that it has only a small effect on the magne- 
tostriction for a given composition. Data for Feg3Gaj7 in the quenched 
and slow-cooled states are shown in Fig. la, b; data for a cubic crystal 
with an isotropic initial domain distribution are shown in Fig. 1c for 
comparison. Our crystals exhibit zigzag-striped domains, with 90° or 
180° domain walls, as described previously® *. We found no sign of the 
small periodic Landau closure domains that are evident in images of 
the surface of a Fe739Gay¢, crystal in figure 3 of ref. 1. 

The magnetization curves for our disk-shaped samples (Fig. 1d) 
resemble those reported in ref. 1, but the negligible magnetization normal 
to a disk magnetized in an in-plane field is accompanied by substantial 
perpendicular magnetostriction whenever z domains are present, rep- 
resented by the —(3/2)n-Aj09 terms in equation (4). Altogether, we have 
obtained 31 row sums from measurements of 12 different crystals, with 
an overall average of 6 +7 p.p.m. None of these row sums is representative 
of the sample in ref. 2, for which the average row sum is 134 p.p.m. 

Finally, we repeated Joule’ original 1846 experiment’, measuring the 
saturation volume magnetostriction of a rod-shaped Feg3Gaj7 crystal 
(diameter, 16mm; height, 70 mm) using the capillary liquid displace- 
ment method in alcohol in a field of 2 kOe. This method avoids the 
shortcomings of strain gauges. The crystal was measured as-grown and 
after quenching or slow-cooling. In all cases, the volume expansion was 
found to be less than 5 p.p.m. 

In summary, we have found no evidence of giant non-Joulian mag- 
netostriction in any of the galfenol crystals that we studied. We do not 
exclude the possibility of minor non-Joulian contributions associated 
with, for example, forced volume magnetostriction, auxetic behaviour 
or the ‘AE effect’ in the unsaturated state. However, we conclude that 
the large intrinsic magnetostriction of galfenol is essentially Joulian and 
that the findings of ref. 1 are not generalizable to this class of iron-based 
magnetostrictive materials. 


Data Availability The data generated and analysed during this study are available 
from the corresponding author on reasonable request. 
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ILLUSTRATION BY THE PROJECT TWINS. 


ECOLOGY S REMOTE-SENSING 
REVOLUTION 


Satellite data, and the tools that ecologists use to analyse them, 
are more accessible and plentiful than ever. 


ao 


BY ROBERTA KWOK 


hen ecologist Nicholas Murray 
started digging into remote-sensing 
data for his PhD project, he had no 


idea how hard his task would be. Murray wanted 
to know why shorebirds that migrate through 
Asia were declining in number. Because the 
birds stopped in places that were difficult for 
Murray to access, such as North Korea and 
China, he turned to satellite data to evaluate 
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their habitat. 

When Murray started the project in 2010 
he guessed it would take a few months, but it 
ended up taking about a year. Murray first had 
to download metadata for about 5,500 publicly 
available US government satellite images to 
identify those of tidal wetlands taken during 
low tide along the Yellow Sea, which borders 
China and the Korean peninsula. He then 
wrote custom software code to classify land 
cover ina final set of 80 images. An algorithm to 


w\ 


distinguish water from land already existed, but 
he needed to make manual adjustments for each 
image. More than one-quarter of the wetland 
area had vanished between the 1980s and 2000s, 
Murray discovered. But the analysis wasn't 
easy. Throughout that whole process, I was 
thinking, “This is so difficult, it's unbelievable,’ 
recalls Murray, now at the University of New 
South Wales in Kensington, Australia. 

Today, Murray's task would be much simpler. 
Numerous tools have been developed to 
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» access and analyse remote-sensing data, 
allowing ecologists to tackle large-scale conser- 
vation problems more easily. Government agen- 
cies, open-source developers and commercial 
firms are offering everything from point-and- 
click interfaces to command-line-driven soft- 
ware. “I think we're at the best time possible to be 
doing it; Murray says of analysing satellite data 
for ecology research. “It's getting so accessible.” 

‘Remote sensing’ encompasses a suite of 
techniques for observing something without 
touching it. The term usually refers to collect- 
ing data about Earth from space or from air- 
borne platforms by measuring energy reflected 
or emitted at various wavelengths. Researchers 
can use these data to infer, for example, the level 
of deforestation. “We've seen a real explosion in 
the use of satellite data,’ says Allison Leidner, 
a contract senior support scientist at NASA’s 
Biological Diversity research programme in 
Washington DC. 

Landsat data, gathered by NASA and the US 
Geological Survey (USGS), extend back to the 
1970s and enable the study of planetary change 
over many decades. NASA’s Moderate Reso- 
lution Imaging Spectroradiometer (MODIS) 
instruments, launched in 1999 and 2002, 
measure reflected solar radiation and emitted 
radiation, and the data are automatically con- 
verted into ecologist-friendly parameters such 
as vegetation greenness. And Europe's Sentinel 
satellites, which monitor the land, ocean and 
atmosphere, have been providing data since 
2014. 

Users can browse free government data sets 
at online portals such as NASA’s Earthdata 
Search, EarthExplorer from the USGS and the 
European Space Agency’s Copernicus Open 
Access Hub. Earth data are typically divided into 
sections called ‘scenes or ‘tiles’ — snapshots of 
energy of varying wavelengths reflected from 
that area. But to obtain higher spatial and 
temporal resolution, researchers might want to 
consider commercial options. 

The Dove satellites operated by Planet 
in San Francisco, California, for example, 
gather global data at a resolution of 
3.7 metres — sharp enough to distinguish 
individual large trees — about once a day. 
The Sentinel-2 satellites, by contrast, which 
are among the highest-resolution govern- 
ment satellites with free and open data, have 
10-metre pixels and sample each spot every 
5 days. University researchers can apply for 
free access to 10,000 square kilometres of 
Planet’s satellite data per month through the 
firm’s Education and Research programme. 
Similarly, academic environmental researchers 
can apply for free access to data from sub-metre- 
resolution satellites operated by DigitalGlobe in 
Westminster, Colorado, through the non-profit 
organization DigitalGlobe Foundation. 


USER-FRIENDLY ACCESS 
Data sets can be unwieldy, however. Kyla Dahlin, 
an ecogeographer at Michigan State University 
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in East Lansing, notes that 30 years of data 
collected for one Landsat scene could exceed 
1.5 terabytes “for an area that’s smaller than 
Michigam. Visualization software for remote- 
sensing data might not function well with 
certain file formats, and although these files can 
be converted into an easier-to-use format, that 
step adds another obstacle, says Cindy Schmidt, 
associate programme manager of Ecological 
Forecasting Applications at the NASA Ames 
Research Center in Moffett Field, California. 
Inexperienced users “just want to throw in the 
towel sometimes’, she says. “They don't have 
time to deal with that kind of stuff” 

Free and commercial resources are available, 
however. In 2017, Murray's team released a free 
online tool called Remap, which enables users 
to generate maps from remote-sensing data. 
Users train the software to classify land-cover 
types, such as forest or wetlands, by upload- 
ing geo-referenced data or identifying pixels 
on the basis of fieldwork or their knowledge. 
Remap then uses machine learning to classify 
the remaining pixels. As of March 2018, about 
4,300 people from more than 100 countries have 
used Remap, Murray says. Another online tool, 
called Global Forest Watch, creates maps of 
deforestation patterns. 

Dahlin recommends the online tool 
AppEEARS (Application for Extracting and 
Exploring Analysis-Ready Samples), which 
allows users to grab data specific to their 
study site, instead of an entire tile or scene. 
“Imagine the archive is this big lake of data,” 
explains Tom Maiersperger, project scientist 
at the NASA Land Processes Distributed 
Active Archive Center (DAAC) in Sioux Falls, 
South Dakota, which led the tool's development. 
“We're allowing people to come in with a syringe 
and suck up that little sample that they want.” 
Users can provide geographical coordinates, 
a time span and variables of interest — such as 
tree cover — and the software returns the data 
as a comma-separated-values (CSV) file. 

Similarly, the US Oak Ridge National 
Laboratory DAAC, in Tennessee, has tools 
to provide for example a time series of green- 
ness for a study site as a spreadsheet and graph, 
or processed data, such as inferred forest dis- 
turbance. Ecologists can then analyse links 
between vegetation and other variables such 
as animal populations. One team, for example, 
studied the Andaman Islands off the Indian 
coast and found that vegetation degraded more 
quickly in areas where elephants and spotted 
deer had been introduced. 

For ecologists who want to write their own 
analysis software, but avoid the hassle of down- 
loading satellite data, Google Earth Engine is 
a popular choice. Google has already down- 
loaded satellite data sets onto its servers, and 
researchers can access them in the cloud for 
free through Google's JavaScript and Python 
programming interfaces. This service allows 
researchers to perform large-scale analyses much 
faster than they could on their local computers. 


Murray, for instance, leveraged that process- 
ing power to map global intertidal zones over 
time. Because it used more than 700,000 satellite 
images, the analysis would have taken years 
on a single computer — but it took less than 
a week on Google Earth Engine. The tool has 
“revolutionized the sorts of remote sensing 
questions I can ask’, Murray says. 

Google says that users need not worry that 
it will claim ownership over their intellectual 
property (IP), such as code and scientific 
results. “Our terms of service make it clear that 
your IP is your IP and we make no claims on 
it,’ says Noel Gorelick, an engineer at Google 
in Zirich, Switzerland, who co-developed 
Google Earth Engine. Still, Martin Wegmann, 
a remote-sensing researcher at the University of 
Wiirzburg in Germany, prefers to download sat- 
ellite data and run his code locally. Because his 
analyses are relatively small in scale or coarse in 
resolution, performance is not an issue, he says. 

Other cloud-computing options include 
the Centre for Environmental Data Analysis, 
run by the Science and Technology Facilities 
Council in Harwell, UK; Copernicus Data and 
Information Access Services, funded by the 
European Commission and scheduled to go 
live in June; and DigitalGlobe’s GBDX platform. 


OPEN-SOURCE OPTIONS 

Whichever platform they choose, research- 
ers typically write custom code to drive data 
analysis, often in the programming language R. 
Wegmann and his colleagues are developing 
an R package called getSpatialData, which will 
allow users to download satellite data without 
using a browser interface. His team also devel- 
oped the RStoolbox package, which includes 
different algorithms for computing vegetation 
measures so that users do not have to calculate 
specific formulae individually. 

Researchers can also use commercial desktop 
analysis and visualization packages such as 
ENVI from Harris in Melbourne, Florida; 
ERDAS IMAGINE from Hexagon Geospatial 
in Madison, Alabama; ArcGIS from Esri in 
Redlands, California, as well as free, open- 
source alternatives such as QGIS. 

Because using these tools can involve steep 
learning curves, Anita Graser, a geographic 
information scientist at the Austrian Institute 
of Technology in Vienna and a member of 
the QGIS project steering committee, advises 
beginners to take online classes. NASAs Applied 
Remote Sensing Training programme offers 
webinars, and the agency gives workshops at 
ecology and conservation conferences. 

The possibilities are enticing, but researchers 
must remember to stay grounded. “If you 
wanted to see how often a butterfly visits 
anectar plant, you're not going to pick that up 
on a satellite,’ Leidner says. But for larger-scale 
problems, “it’s an incredibly powerful tool”. m 


Roberta Kwok is a freelance writer in 
Kirkland, Washington. 
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The path to purpose 


Early-career researchers should persevere to find meaning in their work. 


BY JACK LEEMING 


hen Melissa Craig realized that 
biological matter such as algae 
might slow the speed of powerful 


underwater landslides by releasing chemicals 
that help to glue sea-floor mounds together, 
she failed to share her colleagues’ enthusiasm 
around the discovery. The group, based 
at Bangor University, UK, built miniature 
versions of those landslides in water tanks, 
changing the composition and amount of 
material that made up the mounds to model 
the sea floor at its most violent. 

Craig’s finding was unprecedented. “There 
had been nothing before her experiments that 


found the same results,’ says ocean scientist 
Jaco Baas, Craig’s supervisor at Bangor. But 
Craig, then a PhD student who was visiting 
from the University of Adelaide in Australia, 
couldn't see how her algal discovery would be 
of interest to anyone beyond her immediate 
academic circle. “I struggled to appreciate the 
impact of what I was doing,” she says. So, in 
late February this year, Craig started work- 
ing as a geologist with Oil Search, an energy 
company based in Papua New Guinea. She’s 
hoping to conduct research that has clear 
practical applications. 

Long before they embark on PhD 
programmes, potential scientists are told 
by teachers and the media that their work 


will have a lasting impact. Whether it’s by 
helping to cure diseases, build clean-energy 
infrastructure, or even provide food or 
water for communities affected by famine or 
drought, many early-career researchers hope 
that they can somehow make a difference. “We 
grow up thinking that we're going to solve the 
world’s problems,’ says Florie Mar, a scientific- 
communications director at Genentech in 
South San Francisco, California. She earned 
her PhD in cancer biology in 2015 from the 
University of California, San Francisco. 
Although researchers are often motivated 
by a sense of curiosity and the drive to explore 
how the world works, some also see science as 
a way of making the world a better place. | 
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> It's widely acknowledged that the scientific 
endeavour works as an accumulation of small 
discoveries. “Your knowledge — however 
unpredictable and however useless it may 
appear — will be valuable,’ says Philip Blower, 
a cancer-imaging chemist at King’s College 
London. But Mar and Craig say that they had 
hoped to obtain more tangible results during 
the course of their research. “I wanted some- 
thing that I could measure,’ explains Mar. 
Craig reaches for a similar thought: “I wanted 
something visual — like seeing someone walk 
around with something that you know you 
made or helped with” 

Both now feel that they’re using their 
scientific skills to do meaningful work out- 
side academia. Mar, for example, instructs 
doctors and pharmacists on the clinical uses 
of Genentech’s drugs — information that she 
anticipates will go on to help patients. 

The need to find deeper meaning in their 
work plagues many scientists, who can feel 
stuck in an often cut-throat system that's more 
concerned with bibliometrics than transfer- 
ability. Many also say that they can’t see how 
their research is contributing to society in a 
meaningful way. A 2016 report by professional- 
networking service LinkedIn (see go.nature. 
com/2i2srtc) notes that 41% of research 
professionals — compared with 37% across all 
sectors — say that they are mainly driven by 
purpose rather than by money or status. 

Employers should sit up and take notice: 
the same report found that purpose-driven 
employees had greater job satisfaction and 
were more likely to stay at their company 
for at least three years. To help maintain an 
interest in their work, researchers can try a 
variety of ways to stay motivated. Some seek 
out laboratories that are engaged in highly 
translational research or that collaborate with 
partners in industry. Others look for research 
posts in industry. And more commercially 
focused scientists might even launch start-up 
companies to find the impact and value that 
they require. 


IMPACT THROUGH INDUSTRY 

For many early-career researchers, the sheer 
size of the scientific endeavour, as well as 
an increasing pressure to win grants and to 
publish results, can be discouraging. “When 
I talk to postdocs and PhD students, they 
often feel like cogs in a massive machine,” 
says Jason Blackstock, a lecturer in public 
policy and engineering at University College 
London, who trained as a physicist. “There's 
tremendous pressure to publish in whatever 
the direction the UK research councils are 
funding in” 

Dolores Del Prete, a postdoctoral researcher 
who studies the role of certain cells in the 
brain in psychiatric disorders at BioMed X, a 
contract research organization in Heidelberg, 
Germany, agrees that the system can feel over- 
whelming. During her first stint as a postdoc, 
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The creation of Almac Diagnostics in Craigavon, UK, required funding from outside academia. 


in which she investigated a protein linked to 
Alzheimer’s disease at Albert Einstein College 
of Medicine in New York, Del Prete decided 
that she needed to move into an environment 
that conducted more-applied research. “I was 
doing really basic research,” she says. “I liked 
it, but I had this pressure that was all about 
publishing — to get more papers, to get more 
grants, to keep going. It was frustrating.” She 
thinks that the publish-or-perish hamster 
wheel in which she became stuck will be a con- 
cept familiar to many researchers in academia. 
“Tt was research led by papers and grants, not 
papers and grants led by science,’ she says. 

At BioMed X, Del Prete can see the results of 
her work more clearly. “In the future, industry 
can develop our drug,” she explains. 

For some, the ability to stave off frustration 
and to find meaning in their work can come 
from developing a side project. When Mar 
noticed her PhD research turning from an 
enjoyable endeavour into a three-year chore, 
she started to make videos that combine voice- 
overs and whiteboard illustrations to explain 
complex topics such as genetic variation, 
neuroscience and diabetes to the public. 
So far, she’s posted more than 60 videos on 
YouTube as a way to engage with her audi- 
ence, track interest, teach science — and find 
real-life, immediate impact. She has since 
launched Youreka Science, an independent 
science-communication company. 

Mar says that the skills she acquired during 
her PhD, including critical thinking and com- 
municating scientific ideas to the public, have 
supported her new career direction. “It’s a way 
to utilize your scientific training,” she says. 


TRANSLATIONAL SCIENCE 

Blackstock says that early-career scientists 
who want their research to have wider societal 
impact should seek to work and study at 


institutions that nurture this drive through 
programmes that focus on transferability. 
“If you really want to change the world, just 
learning the technical stuff still leaves you try- 
ing to figure out how anything you've learned 
matters,” he says. Instead, “Find programmes 
that have really strong practical levels of 
engagement. Or, at the very least, programmes 
whose projects have real-world partners” that 
can teach students to apply their findings 
in a practical way outside academia. As an 
example, Blackstock cites University College 
London's ‘How to Change the World’ training 
programme, which he directs, and which pairs 
engineering students with representatives from 
industry and government to examine issues 
such as water quality and public transport. 
Partners have included the UK Department 
of Transport and London-based engineering 
company Atkins. 

He also suggests that students should find 
PhD programmes that collaborate directly 
with industry. Germany’s Fraunhofer Society, 
for example, receives 70% of its research 
funding from contracts with external part- 
ners and operates institutes that focus on 
topics such as lasers and wood technology. 
Developing a device or procedure that people 
need is an integral component of the research 
programme rather than a bonus. 

After Atma Ivancevic completed her PhD in 
bioinformatics at the University of Adelaide, 
she realized that she would need to take care 
in selecting a lab for her postdoctoral work. 
Ivancevic had studied mobile elements in 
DNA — sequences that move around the 
genome across generations to drive evolu- 
tion — and found it enjoyable, but says that 
its transferability was not obvious to her at the 
time. “It was hard to see how it could be appli- 
cable to something these days,” she says. “The 
sorts of effects we're talking about take millions 
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of years to come to fruition” 

She knew that she had found the right 
lab when, during an interview, a potential 
supervisor tilted his computer screen towards 
her. “He showed me four or five e-mails hed 
got that day from families — not from other 
scientists,” says Ivancevic. One asked about 
his research on the genetics of severe epi- 
lepsy in women and whether there had been 
any recent advances. “It didn’t matter if he 
published a paper that year or not,’ she says. 
“He still would have answered those e-mails. 
That's real-life impact right there.” 

Ivancevic thinks that luck, as well as 
design, might play a part in determining the 
impact of scientists’ work. “Maybe they just 
haven't found out how it is applicable yet” 
Blower also believes in scientific serendipity, 
and therefore advocates for research that 
doesn’t always set out to solve a specific prob- 
lem or address a specific issue. “You turn over 
loads of stones and, with most of them, there's 
nothing underneath, but occasionally there’s 
something. If you dort turn over the stone,” 
he says, “you dont find the thing.” 

One such stone revealed the gene-editing 
technology CRISPR. Rachel Haurwitz did 
her PhD and worked as a postdoc in Jennifer 
Doudna’ lab at the University of California, 
Berkeley — one of 


the birthplaces of “When I talk to 
CRISPR. Haurwitz, postdocs and 
now chief execu- PhD students, 
tive of Caribou they often 
Biosciences in feellikecogs 
Berkeley, which inamassive 


aims to commer- machine.” 
cialize the technol- 
ogy, sees the rise of CRISPR as evidence 
to support the continued funding of basic 
research, alongside more translational 
work. “I think this story further cements the 
tremendous value and need for investing in 
basic research,” she says. “To pretend that 
we know exactly where to go to discover or 
invent the next big thing is incredibly naive” 
She suggests that scientists who want to see 
the impact of their work should seek out labs 
and companies that do translational research. 
“Actively find a way to put yourself there,” she 
says. “There are some labs in academia that 
are closer to that boundary, and there are lots 
of companies in industry who use life science 
and technologies to try to solve a problem.” 


REAL-WORLD MOTIVATION 
For some researchers, launching their own 
business can provide the meaning that they 
seek. In 2000, Paul Harkin, a molecular 
oncologist at Queen’s University Belfast, 
began to realize that to extend his work on 
the gene BRCA1, which is implicated in 
hereditary breast cancer, he had to move 
away from academia. 

Harkin had recognized that preserved 
samples of tumours stored at labs and 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights 


hospitals worldwide would be an invaluable 
source of data that links genetic information 
with patients’ outcomes, if clinicians had the 
tools to reliably extract partially degraded 
RNA from the tissue. But he was unable to 
launch the project from his lab at Queen's. “T 
needed to bring in substantial funding and 
additional expertise to get to commercial 
application,” explains Harkin. 

So, in 2004, he co-launched a company — 
now known as Almac Diagnostics and based 
in Craigavon in Northern Ireland, UK — to 
take his work to market. “I’ve never been dis- 
illusioned,” he says, “but I was very pragmatic 
about what could be achieved in an academic 
environment?’ 

The company’s focus has since pivoted 
to providing clinical-trials support to the 
pharmaceutical industry. And Harkin 
notes that at least one of the drugs that it 
has worked on has been marketed in the 
United States. 

Although he draws satisfaction from 
knowing that the company he built is directly 
involved in getting medicines to patients, 
Harkin highlights the positive effect that 
Almac Diagnostics has had on the scientific- 
employment landscape of Northern Ireland. 
He estimates that around 50% of Almac 
Diagnostics’ employees hold PhDs; and its 
parent company, the Almac Group, employs 
more than 3,000 people in the province. 
“There are now alternatives in the scientific 
arena in Northern Ireland — it’s not just jobs 
in academia,” he says. 

Harkin thinks that early-career researchers 
who want to make an impact should seriously 
consider accepting a position in industry. 
“Young scientists coming through don't 
understand the potential in industry,’ he says. 
“You may not own a project in its entirety, but 
youre part of that team that gets something 
into the clinic” 

Yet many scientists maintain that curiosity 
is enough to justify investigating a research 
question. Baas’ interest in sea-floor deposits 
is driven by a wonder at how the world works. 
“What motivates me is discovering things, 
really,’ he says. “I have questions in my head 
all the time; I want to find answers to those 
questions. Research is the ideal vehicle to do 
that. My work is my hobby.’ 

Ivancevic is set to begin another postdoc 
in August. She says that even if she had left 
academia, she would have stayed up to date 
with research in her field, and understands 
the drive of curiosity. “I can see how it can 
consume you,’ she says. “You just want to 
find out why,” 

Craig also expects to keep track of her 
academic field. “It’s almost like a hobby — it’s 
so cool and significant to the geoscience 
community,’ she says. “But I’m still drawn to 
other pursuits that apply my science.” m 


Jack Leeming is the editor of Naturejobs. 


INTERNATIONAL STUDENTS 


A shift in interest 


A report that gauges the preferred 
destinations of prospective students 
from around the world suggests that the 
United States and the United Kingdom 
are losing their appeal for students from 
some regions. ‘Applicant Survey 2018: 
What Drives an International Student 
Today?’ — conducted by London- 

based educational-marketing group 
Quacquarelli Symonds during the 
2016-17 academic year — finds that 
more students than before are aiming for 
Canada, Australia or elsewhere. Overall, 
48% of the 16,560 students surveyed listed 
the United States as one of their preferred 
destinations. The United Kingdom came 
in second at 42%, followed by Canada 

at 34%, and Australia and Germany 

at 28% each. The survey found that 
Canada had risen in popularity with 
prospective students from all regions, 
and had replaced the United Kingdom 

as the second most popular destination 
for respondents from Latin America and 
the Middle East and Africa. The United 
States had declined in popularity in some 
countries in Africa and the Middle East. 
The report speculates that the election of 
Donald Trump as US president and the 
UK Brexit vote might have influenced 
respondents’ indications of interest. 


UNIVERSITIES 


Fewer women at the top 


Female leadership at 200 of the 
top-ranked universities worldwide fell 

this year to 17%, according to a report. 

Just 34 of the universities named in the 
2018 Times Higher Education World 
University Rankings have female 
presidents, compared with 36 last year. 
Among the listing’s highest-ranked 
institutions across 27 nations are the 
University of Oxford, UK; Harvard 
University in Cambridge, Massachusetts; 
Imperial College London; the University 
of Pennsylvania in Philadelphia; and 

the University of California, Berkeley. The 
rankings consider research, teaching and 
international outlook among other factors. 
In Sweden, 4 of the 6 institutions that made 
the list are led by women. The United 
States has 11 female-led universities in 

the rankings, the report’s highest number. 
Janet Metcalfe, head of Vitae, a UK-based 
advocacy group for researchers, expressed 
concern at the figures. “More women in 
leadership positions provides positive 

role models for female academics,’ says 
Metcalfe, “and can encourage better gender 
balance and diversity at all levels.” 
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Some media did call it so, even if this 
alien didn’t come with guns a-blazing. 
Was he even male? No one could tell, but from 
the start the media defaulted to their per- 
ceived gender of an assumed conqueror. He 
came down from the sky in his craft of light — 
his pod, or capsule, or perhaps cocoon. With 
a smooth, effortless descent he landed in the 
Sahara Desert, ona stretch of dirt and yellow 
shards where millennia ago, before Ramses 
and Alexander, intense heat had turned sand 
to glass. And then the web erupted with 
triumphant screeches about the old gods 
returning to lead mankind to ascension. 

Only he didn't. 

He strolled through the streets of Cairo 
with the familiarity of someone born there. 
He didn't actually walk — he glided a couple 
of centimetres above the ground, clad in 
his ankle-long, shimmering garment that 
could bea robe or a kaftan, and browsed the 
stalls and booths of Khan el-Khalili just like 
another tourist. His dark, lidless eyes scanned 
fabrics and glassware, and perhaps lingered 
a little too long on the miniature cat-shaped 
statues displayed on the souq stalls before 
seeking the wide-eyed, living cats that had 
inspired them. 

Long fingers — an artist’s fingers, some 
argued, too delicate for warfare — brushed 
against spice racks, dried fruit and flat- 
bread amid other, non-edible wares. Then 
he stopped before a street-food cart and 
its selection of grilled meats: kofta, kebab, 
shawarma. Under the flabbergasted gaze of 
the petrified vendor, he broke off the tini- 
est piece of lamb meat and brought it to his 
mouth. Something that could be a smile lit 
up the expressionless face, and something 
that could be words left his thin lips. Then 
he stood there for a moment that stretched 
on, awaiting a response that never came, 
save from the vendor’s white-knuckled grip 
around his nazar amulet. 

And then he vanished. 

He appeared again sampling noodle soup 
in Hong Kong, then a shot of espresso in 
Rome, then in Peru and Mumbai and Singa- 
pore, and several other places around the 
world. He ventured through local markets, 
sampled their food and, some believed, 
tried to engage the locals in conversation. 
The media crews flocked behind him with 
every possible recording device, and they all 
failed. His garments emitted a force field that 


I s it an invasion if it’s only one alien? 
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rendered all electronics useless after a certain 
radius. Electronics and bullets. Because 


when the lunatics came — how could they 
not? — assigning human stupidity to one 
deity or another, no bullet or bomb affected 
his stroll through the eateries of Earth. 

And while theories on his origin and 
purpose raged, while the cooks and vendors 
he visited became celebrities overnight, his 
glow seemed to diminish with every new 
visit. Something did affect him. 

And then he was gone. No one saw him 
for two weeks, and many assumed hed left. 
Or died. 

Until he appeared again, this time on a 
backwater island of an indebted country in 
the Mediterranean. No markets this time, 
only an old fisherman cooking fish fresh 
from his nets, and the alien perched upona 
boulder a few metres away. The lone drone 
that caught this exchange recorded the alien's 
lips moving, and the fisherman, grey and 
withered like his boat, shaking his head. 

“Tthaca? No Ithaca,” he told the alien in 
broken English. A tourist is a tourist — 
what else would he speak? Then the old man 
pointed westwards. “This, Aegean. Ithaca, 
other sea. Ionian sea.” 

The old man poked the pile of embers 
and ash where something was roasting. He 
retrieved a package wrapped in aluminium 
foil and unfolded it. He held up a sardine 
enclosed in a thick salt crust. 

“Here. Try, phile? 

Phile. Friend. Did the alien eyes widen at 

the word? 
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Back to Odysseus.” 

“Odysseus,” said the alien and shattered 
the crust to reach the fish meat inside. He 
took one bite. “Good” 

As the old man reached for a second sar- 
dine, a woman emerged from the little house 
behind them, with the whitewashed walls 
and the blue windows. The drones that had 
swarmed to the area ran her face through 
every possible database: a retired school- 
teacher, the fisherman’s wife, who once 
taught an ancient, useless language to bored 
teenagers. Just another nobody in the long 
string of nobodies the alien had approached. 
She came bearing gifts of alcohol, and the 
alien mumbled something to her. 

A moment of wide-eyed silence, then a 
slow shake of her head. “No. Not oinos kekra- 
menos. We don't drink that anymore. Here. 
Try that. Ouzo. Good. Better than ambrosia.” 

If doubts slowed the alien’s initial gulps 
of the star-anise-smelling liquor, they were 
washed down with the second bottle and the 
sardines that accompanied it. Then other 
dishes came out: olives and grilled peppers 
and hard bread softened with olive oil and 
topped with goat cheese. 

“Not your first time here?” she asked the 
alien. 

He licked the oil from his fingers and 
nodded. 

“Long ago?” 

Another nod. 

“Why now?” 

A long stare. He held out his open palm. 
The visage of an ancient copper drachma 
flickered on it. His palm closed a little too 
slow, as if in pain. 

She sighed, and nodded, and offered him 
more ouzo. “One more for the journey. And 
for the Ferryman.” 

Beneath the crescent Moon, three old- 
timers ate and drank their night away, two of 
them counting their lifespans in decades, the 
third in millennia. When dawn came, they 
were gone. Only then did humankind realize 
how much the alien craft had looked like a 
coffin. But they never found it to confirm it, 
nor did they find the old couple, and soon 
they were all forgotten. 

Until another ship came. = 
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